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EXPERIMENTAL ALTERATIONS IN THE EFFECTIVENESS 
OF VERBAL REINFORCEMENT COMBINATIONS 


ALEXANDER M. BUCHWALD 


Indiana University 


The experiments reported in this 
paper deal with response acquisition 
under three reinforcement combina- 
tions, RW, RN, and NW. The 
events R, W, and N consist of E 
saying ‘‘Right’’ (R), ““Wrong’’ (W), 
or nothing (N) after S’s response. 
Use of a particular combination means 
that the first named event follows all 
correct responses and the second all 
incorrect responses. In several ex- 
periments (Buchwald, 1959; Buss, 
Braden, Orgel, & Buss, 1956; Buss & 
Buss, 1956) involving various re- 
sponse-selection tasks, and in which 
the instructions included no references 
to, learning, correct responses, or the 
events R, W, and N, Ss learning under 
RN have learned more slowly than 
other Ss, and differences between Ss 
learning under RW and those learning 
under NW have been minimal. 

Explanations of these results have 
involved various hypotheses concern- 
ing the reinforcement values of the 
events R, W, and N. (The term 
reinforcement value refers to the effect 
of a particular event used as a rein- 
forcer, i.e., to a parameter which 
determines the change in response 
probability when that event follows a 


response.) In one explanation (Buss, 
Braden, Orgel, & Buss, 1956; Buss & 
Buss, 1956) it was assumed that R is 
a very weak positive reinforcer, W a 
much stronger negative reinforcer, and 
that N has a zero reinforcement value. 
In a second explanation (Buchwald, 
1959) it was assumed that when S 
is exposed to either RN or to NW, N > 
acquires a reinforcement value oppo- 
site in direction to that of the event 
with which it is combined, and that 
the value acquired by N in the NW 
combination exceeds that which it 
acquires in the RN _ combination. 
These explanations account for acqui- 
sition data equally well, but the second 
one also led to correct predictions 
concerning the effects of an extinction 
procedure following acquisition under 
the various combinations (Buchwald, 
1959). 


EXPERIMENT | 


Experiment I was designed to test 
several of the assumptions concerning 
reinforcement values. In contrast to 
earlier (single combination) experi- 
ments in which each S was exposed 
to only one reinforcement combina- 
tion, Exp. | was a “multiple combina- 
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tion’’ experiment since each S was 
exposed to all three combinations in 
a single series of trials. This was 
accomplished by assigning each com- 
bination to a different pair of response 
alternatives, with all of these pre- 
sented in a single series of trials. 

This procedure makes possible a 
test of the different assumptions about 
the reinforcement value of N. Under 
the assumption that N maintains a 
zero reinforcement value the multiple 
combination procedure is essentially 
similar to a single combination pro- 
cedure. Hence the results of this 
experiment should be similar to those 
of earlier experiments. However, un- 
der the assumption that N acquires 
a reinforcement value when S is 
exposed to RN or to NW, the present 
results should differ from thoge of 
earlier experiments. 

Since Ss are exposed to R, W, and 
N under conditions which make it 
difficult to discriminate the events 
assigned to various response alterna- 
tives, it is reasonable to assume that 
N will acquire a positive value for 
some Ss, a negative value for other 
Ss, and no reinforcement value for 
still others. These occurrences will 
tend to retard response acquisition 
under RN, under NW, and under 
both, respectively, as compared to a 
single combination situation. Fur- 
ther, the hypothesis that N acquires 
greater reinforcement value under 
exposure to NW than under exposure 
to RN implies that the present pro- 
cedure will retard response acquisi- 
tion under NW to a greater extent 
than it will affect acquisition under 
RN. Therefore, in the present experi- 
ment, acquisition under both RN and 
NW should be slower (relative to 
that under RW) than in earlier experi- 
ments, and (unless N acquires a posi- 
tive reinforcement value for the great 
majority of Ss) the previously found 
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superiority of response acquisition 
under NW, compared to RN, should 
be lessened or should disappear. 

To test these hypotheses and the 
assumptions which underlie them, the 
acquisition data obtained under the 
various combinations will be com- 
pared in the same respects as those 
in an earlier single combination study 
(Buchwald, 1959), and these re- 
sults compared with those of single 
combination experiments. 

Data obtained in the present experi- 
ment can also be used to test hypothe- 
ses concerning the relative reinforce- 
ment values of R and W. These 
hypotheses can be tested by estimat- 
ing learning parameters for these 
events since under the present experi- 
mental conditions the relative values 
of these parameters will depend upon 
differences in the effectiveness of the 
reinforcing events. Since the present 
experimental arrangements are similar 
to those to which the mathematical 
model developed by Estes (1957b) 
applies, this model can be used to 
estimate the reinforcement param- 
eters and to provide theoretical curves 
based on these estimates. The fit 
of these curves to the trial data can be 
used to test the hypotheses in question. 


Method 


Subjects.—The Ss were 36 students en- 
rolled in introductory psychology courses.! 

Materials.—The materials, which have 
been more fully described in an earlier paper 
(Buchwald, 1959), consisted of three packs, 
each containing 216 3 X 5-in. cards. A pair 
of 40% association value (Hilgard, 1951) 
nonsense syllables placed one above the other, 
and a three-digit number, appeared on each 
card. Three different syllable pairs and nine 
different numbers were used. For any one 
pack three different numbers were assigned 
to each syllable pair, with the assignments for 


The author wishes to acknowledge the 
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different packs being made according to the 
principles of a latin square. All numbers ap- 
peared equally frequently in each pack; each 
member of a syllable pair appearing equally 
often in the upper and lower position with 
each of its assigned numbers. Each pack was 
separately randomized with the restrictions 
that each block of three cards contain all 
syllable pairs, and each block of 18 cards con- 
tain one of each nonidentical card in the pack. 
For each pack a different syllable pair was 
assigned to each reinforcement combination 
with each of the syllables in a pair being 
correct for half of the Ss. 

Procedure.—The Ss were randomly as- 
signed to card packs, correct-response syl- 
lables, and one of two E’s, subject to restric- 
tions which balanced all of these in a factorial 
design. 

Each S was run individually. The S was 
seated across a table from E and told that 
this was “a coding task’ and that for each 
card he was to respond with the one of the two 
syllables shown which “goes with the number 
on the card.’ He was further informed that 
initially he might have to guess, but that “‘it 
will become clear to you as we go.”’ A ver- 
batim account of the instructions appeared in 
an earlier paper (Buchwald, 1959). 

Each of the 216 cards was exposed to S 
one at a time. Following each response E 
took the action indicated below, recorded the 
response, and exposed the next card. 

Following every response E said “Right” 
(R), or “Wrong” (W), or said nothing (N) 
according to the following plan: RW syllable 
pair, R for correct, W for incorrect responses ; 
NW syllable pair, N for correct, W for in- 
correct responses; RN syllable pair, R for 
correct, N for incorrect responses. The in- 
dicated reinforcement event occurred follow 
ing every appropriate response made by 5. 
The various reinforcers were contingent only 
on the syllable chosen for the response, the 
numbers being merely irrelevant stimuli. 


Results and Discussion 


Inspection of the data indicated 
that differences due to card packs, 
correct-response syllables, and F's, 
were small or nonexistent. Since 
these variables are of no interest in 
the present connection they were 
disregarded in the following analyses. 

Comparison of data for the three 
combinations.—The number of incor- 
rect responses given on each syllable 


TABLE 1 


SUMMARY STATISTICS FOR NUMBER OF 
INCORRECT RESPONSES UNDER THE 
Various COMBINATIONS (72 
TRIALS Per COMBINATION) 


Reinforcement | 
Combination 


Mean Mdn SD Range 


RN 17.22 | 8.0 | 21.01 | 

NW 10.36 | 7.0 

RW 6.17) 3.0 | 
: | 


0-71 


10.52 | 1-52 
6.14 | 0-22 
| 


pair was counted for each S. Sum- 
mary statistics for this variable are 
shown in Table 1. As the statistics 
in Table 1 suggest, these data are 
badly skewed, particularly in the case 
of the RN pair. 

Over-all differences were tested 
using Friedman's analysis of variance 
by ranks (Siegel, 1956) and were 
found to be significant at the .01 
level. The sign test applied to differ- 
ences between the RW pair and each 
of the other pairs was also significant 
at the .01 level. However, the same 
test applied to the RN vs. NW data 
was not significant at the .05 level. 
To help guard against a Type II 
error the Wilcoxon test for paired 
replicates (Siegel, 1956), and the ¢ 
test for paired measures, were also 
applied to the RN vs. NW data with 
the same lack of significance resulting. 
Since consideration of intra-individual 
data shows a median difference of 
three fewer incorrect responses for 
RW than for either RN or NW, and 
a median difference of zero for RN vs. 
NW, these results are not surprising. 
Note that the mean difference in 
number of incorrect responses between 
the RN and NW pairs does not stem 
from a majority of Ss making more 
incorrect responses on the RN pair. 
Seventeen of the Ss made more incor- 
rect responses on the RN pair than 
they did on the NW pair, 16 Ss made 
fewer incorrect responses on the RN 
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TABLE 2 


SUMMARY STATISTICS FOR Last TRIAL ON 
Wuicu AN INCORRECT RESPONSE 


OccurkED UNDER THE VARIOUS 
COMBINATIONS 


ae —— 
Mean Mdn | SD 

3| 14.5 | 28.3 | 0-72 

31.5 | 21.0 | 2-72 

07.5 | 16.0 | 0-57 


Range 


pair, and for 3 Ss there was no differ- 
ence. In contrast, 25 of 36 Ss made 
more incorrect responses on the RN 
pair than on the RW pair, and the 
same number made more incorrect 
responses on the NW pair than on the 
RW pair. 

As a second response measure the 
last trial on which an incorrect re- 
sponse occurred was tabulated for 
each syllable pair. Summary statis- 
tics for this variable are shown in 
Table 2. Again the data are markedly 
skewed. 

Over-all differences for last incor- 
rect trial are significant at the .01 
level (Friedman test). The last in- 
correct trial occurred earlier on the 
RW pair than on either of the other 
pairs (both at the .01 level using the 
sign test), and earlier on the RN pair 
than on the NW pair (at the .05 level 
using the sign test). For individual 
Ss the median last incorrect trial 
occurred 9 trials earlier on the RN 
pair than on the NW pair, and 13.5 
trials earlier on the RW pair than on 
the NW pair. 

These data can be summarized as 
follows. The bulk of the Ss made 
fewest incorrect responses, and had 
their last incorrect trial earliest, on 
the RW pair. About half of the Ss 
made fewer incorrect responses on the 
RN pair than on the NW pair, while 
for the other Ss this pattern was 
The Ss who made 


reversed. more 


incorrect responses on the RN pair 
than they did on the NW pair tended 
to make many more incorrect re- 
sponses on the RN pair, but for all 
Ss the last incorrect trial tended to 
occur earlier on the RN pair than on 
the NW pair. 


When these results are compared with 
those of single combination studies 
several differences are apparent. First, 
the present finding that significantly 
more incorrect responses occur on the 
NW pair than on the RW pair is counter 
to the results of single combination ex- 
periments. In the three such experi- 
ments (Buchwald, 1959; Buss, Braden, 
Orgel, & Buss, 1956) in which error 
measures were used no such difference 
was found. Second, the present finding 
that the last incorrect trial occurred later 
on the NW pair than on the RN pair is 
unparalleled in single combination ex- 
periments. Previous studies using this 
measure, or comparable trials to criterion 
measures (Buchwald, 1959; Buss & Buss, 
1956) have yielded the opposite result. 
In fact in five sets of data obtained from 
single combination studies there has been 
no finding of superior acquisition under 
RN, as compared to NW, on any response 
measure. Both of these differences are 
in the direction predicted on the basis of 
the assumption that N acquires reinforce- 
ment value during response acquisition 
under RN or under NW. 

The first of the differences mentioned 
may, of course, be due to the inherently 
greater sensitivity of the multiple com- 
bination procedure where each S serves 
as his own control. But this argument 
concerning sensitivity should apply 
equally well to the difference between 
the RN and NW pairs. There is no 
reason why the present design should be 
more sensitive in one regard than the 
other. 

On the basis of the foregoing it seems 
reasonable to conclude that the present 
results do deviate from single combina- 
tion results in ways which are (grossly) 
predictable from the assumption that N 
acquires reinforcement value when S is 
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exposed to it as one of a combination of 
reinforcement events. Taken together 
with the earlier finding (Buchwald, 1959) 
that in a single combination experiment 
a typical extinction procedure (N for all 
responses) yields practically no extinction 
unless S has been exposed to RN during 
acquisition, these results provide strong 
support for the acquisition-of-reinforce- 
ment-value hypothesis. 

Comparison of the data shown in 
Tables 1 and 2 suggests that there is some 
divergence of the two response measures 
used in this experiment. The data for 
individual Ss reveal this more strikingly ; 
many Ss made more incorrect responses 
on the RN pair and had a later last 
incorrect trial on the NW pair. The 
reverse pattern did not occur. This sug- 
gests that there is some aspect of the 
procedure which leads to the occurrence 
of incorrect NW responses relatively late 
in the series of trials. Consideration of 
the procedure in the light of the assump- 
tion that the reinforcement value of N 
is dependent upon the reinforcement 
event which is an alternative to it sug- 
gested the following explanation. When 
S no longer makes incorrect responses 
on the NW and RW pairs, all responses 
are followed by either Ror N. If S has 
not discriminated the different reinforce- 
ment contingencies associated with the 
several pairs, subsequent trials will be 
similar to those under a single combina- 
tion RN procedure and N will acquire a 
negative reinforcement value. Since this 
negative reinforcer will follow all correct 
responses on the NW pair, the prob- 
ability of these responses will decrease. 
If the series continues for enough trials 
S will produce an incorrect response on 
the NW pair. The occurrence of this 
isolated event, incorrect response—W, 
may lead to a correct discrimination of 
the reinforcement contingencies and 
therefore to a cessation of incorrect re- 
sponses on the NW pair. 

It follows from this explanation that 
the number of trials intervening between 
the next-to-last and the last incorrect 
responses should be greater for the NW 
pair than for the other pairs. The data 
are summarized in Table 3. The data 


TABLE 3 


SUMMARY STATISTICS FOR NUMBER OF TRIALS 
INTERVENING BETWEEN THE Last Two 
INCORRECT RESPONSES UNDER 
THE VARIOUS COMBINATIONS 


Mean | | Mdn | SD »| Range | 


Combination 


Reinforcement ont | 
® 
| 
| 


2. 48 
| 10.06 
4.55 | 


* Number of Ss pre at least two incorrect re 
sponses on that pair. Data from Ss not having two 
incorrect responses were not used in calculating entries 
in this table. 


differ among the three pairs at the 01. 
level of significance (Friedman test), with 
the NW differences being larger than 
those for RN and for RW, at the .001 
and .02 levels, respectively (sign test). 
These results are not merely an artifact 
of the differing numbers of incorrect re- 
sponses since only data from Ss with at 
least two incorrect responses on each 
pair are included, and since similar results 
were obtained when Ss for whom the 
next-to-last incorrect trial occurred very 
late in the trial series were excluded. 

Estimation of parameters and curve 
fitting. —Estes and his associates have 
shown that a simple equation fits trial 
data obtained from experimental situa- 
tions in which S is informed after each 
response as to whether or not it was 
correct (Estes, 1957a, 1957b; Estes & 
Straughan, 1954). The derivation of 
this equation involves the assumption 
that P, (the probability of occurrence 
of the correct response on the nth trial) 
does not depend upon whether a correct, 
or an incorrect, response occurred on the 
n— 1 trial. For the present RW pair 
this is tantamount to the assumption 
that R and W have equally large rein- 
forcement values with opposite sign. 

The applicable equation (Estes, 1957b) 
is 


P,=1—(1—P)(1—0)" [1] 


where P; is the probability of correct 
responses prior to the first trial, and @ is 
the learning parameter. By estimating 
P, separately, it is possible to estimate 6 
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PROPORTION CORRECT RESPONSES 





"4 





‘ec e*eseereecwene ee ee @ oh 88 78 
3 TRIAL BLOCKS 

Fic. 1. Theoretical curves and empirical 
data for acquisition under RW. The solid 
line represents the curve with 6g = Ow; the 
dashed line represents the estimated curve 
with 02 # Ow; the solid circles represent the 
empirical data. 


from the mean number of incorrect re- 
sponses in a series of N trials.? Using 
P, = .62 (the proportion of initial re- 
sponses on all pairs which were correct) 
yields as an estimate, 0 = .0615. These 
values were used in Equation 1 to 
produce the solid curve shown together 


with the empirical RW data in Fig. 1. 
For purposes of comparison, separate 

estimates of the relative reinforcement 

parameters for R(@z) and for W(@w) were 


made. These estimates do not involve 
any assumption of equality of strength 
for R and W but they do require the 
assumption that the reinforcement value 
of N is zero in the present experiment. 
The estimate of 0g was obtained from the 
mean number of incorrect responses over 
the 72 RN trials by use of equation 9.20 
in Bush and Mosteller (1955, p. 198). 
The obtained value is .0278. Substitut- 
ing this value in Bush and Mosteller’s 
equation 9.20 (p. 198) yielded an esti- 
mate of Ow = .2587, based on RW data. 
This pair of values corresponds to her 
hypothesis that W is a much strongte 


? By algebraic manipulation and summa- 
tion of both sides of Equation 1 
noN 
[1 


-( 
z= (1 — P,) = (1 — P,) 
nel 0 


~ 0) J 


The left hand side is the expected number of 
incorrect responses in N trials. 


reinforcer than is R. Since there is no 
known equation which is feasible to use 
to compute a theoretical RW curve from 
this pair of values, a theoretical curve 
was estimated by a Monte Carlo method 
designated as the use of ‘‘statrats’’ (Bush 
& Mosteller, 1955, pp. 129-131). The 
dashed curve shown in Fig. 1 represents 
the mean probability of correct responses 
obtained by running 10 statistical Ss, 
each with P,; = .62, 0g = .0278, and 
Ow = .2587, and behaving according to 
Estes’ (1957b) model.* 

Inspection of Fig. 1 suggests that the 
equal @ curve provides a somewhat better 
fit to the data than the alternative curve. 
Deviations of empirical data from the 
former curve are smaller on 20 of the 24 
trial blocks than are the deviations from 
the second curve. Differences between 
actual and theoretical proportions range 
from .0005 to .0721 for the equal 6 curve 
and from .0022 to .1261 for the latter 
curve, with the mean deviations being 
.0158 and .0250, respectively. Note in 
Fig. 1 that the fit of the unequal @ curve 
tends to be poorest where the two 
theoretical curves are most divergent. 
This is borne out by the results obtained 
by correlating differences between the 
two theoretical curves with deviations of 
empirical data from the unequal @ curve. 
The rank-difference correlation of .51 
(significant at .02) indicates that the 
unequal @ curve tends to fit the data best 
when the unequal @ curve lies close to the 
equal 6 curve. By contrast, the corre- 
sponding correlation for the equal @ curve 
is only .16 (nonsignificant). While these 
various indications of better fit are sug- 
gestive they do not seem sufficiently 
strong to warrant rejecting the hy- 
pothesis of unequal reinforcement values 
for R and W. Application of the runs 
test (Bush & Mosteller, 1955, p. 214) to 
direction of deviations of empirical data 
from theoretical curves (used as a test 
of goodness of fit) does not allow rejection 


* The responses consisted of entries from 
a table of random numbers. When the 
nth entry for an S was no larger than 
Pa, Pasi = (1 — Or) P, + Or. When the nth 
entry was larger than P,, Pay: = (1 — Ow) Pn 
+ Ow. 





EFFECTIVENESS OF VERBAL REINFORCEMENT 


of the null hypothesis at the .05 level for 
either curve. 

Therefore, the results of this experi- 
ment (a) confirm the hypothesis that N 
acquires reinforcement value when S is 
exposed to this event in single combina- 
tion procedures, (b) suggest that in a 
multiple combination procedure N may 
vary in value, perhaps in both directions, 
during the series of trials, and (c) are in 
accordance with, but at best weakly 
confirm, the hypothesis that R and W 
are equally potent reinforcers. 


EXPERIMENT II 

Experiment II was designed pri- 
marily to provide an additional test 
of the hypothesis that an originally 
neutral event, N, acquires reinforce- 
ment value when it is an alternative 
to some other event which already 
possesses a nonzero reinforcement 
value. Because this hypothesis is a 


relatively new one, it seems desirable 
to test it in a number of different 


ways. 

There were four successive sets of 
trials. In Set I, “original learning,” 
S was exposed to the task used in 
Exp. I under the RW combination 
until he reached a learning criterion. 
In Set II, “induction training,” S 
was exposed to a revised version of 
this task under either NW, or RN. 
In Set III, S was tested for retention 
of the originally learned responses 
and given retraining if necessary. 
In Set IV, ‘reversal learning,” S 
was given a series of trials on the 
original material, under multiple com- 
bination conditions, but with the 
responses which were correct in origi- 
nal learning now incorrect, and vice 
versa. 

Sets I and III were intended to 
bring the probability of correct re- 
sponses to zero (or near this figure) 
at the start of reversal learning. 
Induction training was used to intro- 
duce the experimental variables, rein- 
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forcement combination used in induc- 
tion training, and number of trials 
of induction training. If the hypoth- 
esis that N acquires reinforcement 
value is correct, these variables will 
affect performance during reversal 
learning. 

This design obviously requires that 
the task and materials used in induc- 
tion training differ from those used 
in reversal learning. However, pre- 
liminary work suggested that if the 
tasks were made too dissimilar the 
reinforcement value acquired by N 
might not apply in reversal learning. 
To meet this difficulty, a revised 
version of the task used in Exp. I was 
devised for use in induction training. 


Method 


Subjects.—The Ss were 84 introductory 
psychology students.‘ None of these had 
previously served in an experiment of this 
type. 

Materials.—The materials used for all 
parts of this experiment except for induction 
training were identical with those used in 
Exp. I. 

Three new packs of cards were constructed 
for use in induction training. The format of 
these cards was identical with that of the 
original cards. Six different syllable pairs 
were used in the new packs. For any S three 
of these pairs would consist of pairs of the 
three syllables which were correct in original 
learning, and three would consist of pairs of 
the three syllables which were incorrect in 
original learning. Each new pack was used 
in conjunction with only one of the original 
packs. These two packs were related in that 
a number assigned to a given syllable pair in 
the new pack had not appeared with either 
member of that pair in the original pack. 
Each of the new packs contained 48 cards 
arranged in random order subject to the 
restriction that each of the six syllable pairs 
appear once in each block of six cards. 

Procedure.—The Ss were randomly as- 
signed to groups, card packs, and correct 
responses in original learning, subject to 


‘The author wishes to acknowledge the 
assistance of Muazzez O. Poffenbarger who 
ran these Ss. 
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restrictions which jointly balanced card packs 
and correct responses over groups. 

The instructions and basic procedure were 
identical with those used in Exp. I. The 
different parts of the experiment were run off, 
without any break, in the following order: 

Original learning utilized the same pro- 
cedure for all Ss. All correct responses were 
followed by R, all incorrect responses by W. 
Training was continued until S had given 
correct responses to all cards in an 18-card 
block. Only one S had to be discarded be- 
cause he failed to reach this criterion within 
the previously set limit of 162 trials. 

Induction training consisted of 12, 24, or 48 
trials, with R for correct responses and N for 
incorrect responses (Groups R12, R24, R48), 
or with N for correct responses and W for in- 
correct responses (Groups W12, W24, W48). 
Group 0 received no induction training. 
During induction training each syllable was 
correct in one of the two syllable pairs in 
which it occurred, and incorrect in the other. 
For example, DIB was correct when paired 
with ZEW, and incorrect when paired with 
MOH. 

Retention and retraining were carried out 
for all Ss except those in Group 0. The S 
responded until he had reached a criterion of 
three consecutive correct responses. Each S 
was run under the reinforcement combination 
to which he had been exposed during ipduc- 
tion training. 

Reversal learning was identical for all Ss 
except that the assignment of syllable pairs 
to reinforcement combinations was different 
for each card pack. During reversal learning, 
responses which had been incorrect during 
original learning were correct, and vice versa. 
The procedure was identical with that for 
Exp. I except that S was run for 48 trials on 
each of the three syllable pairs. 
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Results and Discussion 


The number of incorrect responses 
made in original learning was subjected 
to analysis of variance to determine 
whether there were differences in the 
performance of the various groups 
prior to the introduction of the experi- 
mental treatments. All of the F 
ratios obtained from a simple analysis 
of variance on all seven groups, and 
from a 2 X3 analysis on the six 
groups given induction training, were 
less than unity. 

Summary statistics for number of 
incorrect responses during reversal 
learning are shown in Table 4. The 
data of primary interest are differ- 
ences in the number of incorrect 
responses on the RN pair and on the 
NW pair. The statistical analysis 
was carried out on the square root 
of these differences since these vari- 
ances were homogeneous while the 
variances of the untransformed dif- 
ferences were not. A 2 X 3 analysis 
of variance (Group 0 was omitted 
from this and subsequent analyses 
except where explicitly mentioned) 
applied to these data yielded sig- 
nificant F ratios for reinforcement 
combination used in induction train- 
ing (P <.01), and for interaction 
(P < .05), but not for number of 
trials of induction training. 


TABLE 4 


NuMBER OF INCORRECT RESPONSES DURING REVERSAL LEARNING 











Group 


Ez | 
> | 





R12 
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EFFECTIVENESS OF VERBAL REINFORCEMENT 


To ascertain whether these effects 
stemmed from experimentally pro- 
duced differences in performance on 
the RN pair, or on the NW pair, 
or on both, analyses were also per- 
formed on the variables, NW-RW 
(number of incorrect NW responses 
minus number of incorrect RW re- 
sponses), and RN-RW (number of 
incorrect RN responses minus number 
of incorrect RW _ responses). The 
RW data were used in this way in 
order to remove the effects of indi- 
vidual differences in speed of reversal 
from the NW, and RN data, and thus 
to decrease within-cell variances. 

The NW-RW data were subjected 
to a 2 X 3 analysis of variance which 
yielded a significant F ratio only for 
reinforcement combinations used in 
induction training (P < .05). The 
RN-RW differences could not be 
subjected to analysis of variance since 
the variances were heterogeneous and 


it appeared unlikely that any trans- 
formation would lead to homogeneity. 
Therefore these data were tested by 
means of a nonparametric test (Wil- 


son, 1956). The Wilson test yielded 
significant x? values for reinforcement 
combinations (P < .01), and for inter- 
action (P < .001). None of these 
results can be attributed to differences 
in performance on the RW pair since 
a 2 X 3 analysis of variance of the 
RW data yielded no F greater than 
unity. 

These results can be summarized 
as follows. In general, the number 
of incorrect RN responses exceeded 
the number of incorrect NW re- 
sponses, but these differences were 
greater for Ss given induction training 
under NW than for those given in- 
duction training under RN. The 
extent to which pairs of groups given 
identical numbers of induction train- 
ing trials differed in this respect, was 
a positive function of number of in- 
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duction training trials; the difference 
between groups R12 and W12 being 
virtually nonexistent. While all of 
these differences were largely due to 
the effect of induction training on 
performance on the RN pair, differ- 
ences in performance on the NW pair 
also made some contribution. 

Data consisting of the last trial 
on which an incorrect response oc- 
curred were also subjected to sta- 
tistical analysis. The Wilson test 
was used for all tests on these data 
because of the extreme skewness of 
the data obtained from some of the 
groups. The data generally resemble 
the incorrect response data and since 
only minor differences in significance 
levels resulted, this material will not 
be reported. 

In all of the above analyses data 
obtained from Group 0 have been 
omitted. It is apparent from Table 
4 that Ss in Group 0 tended to make 
fewer incorrect responses on all syl- 
lable pairs than Ss in the other groups. 
Examination of data from individual 
Ss discloses that 6 of the 12 Ss in 
Group 0 made no more than two 
incorrect responses on any of the three 
syllable pairs, while only 4 of the 72 
other Ss performed at this same level. 
These differences suggest that reversal 
learning immediately after original 
learning may differ qualitatively from 
reversal learning following a series of 
induction training trials. For this 
reason, no use is made of the data 
from Group 0. 


The results of this experiment indicate 
that exposure to either RN or to NW 
affects the later course of learning under 
these reinforcement combinations. In 
general, the results are consistent with 
the hypothesis that a neutral event will 
acquire a reinforcement value opposite 
in sign to that of an event to which it is 
an alternative. However there are sev- 
eral aspects of the data which do not 
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seem to fit neatly within this inter- 
pretation. Two of these, the superiority 
of performance under NW in Groups 
R12 and W12, and differences in the 
extent to which induction training af- 
fected performance’ under NW and 
performance under RN, can be explained 
by considering the experimental pro- 
cedure. 

From the near identity of the data 
obtained from Groups R12 and W12, it 
can be concluded that 12 trials of induc- 
tion training were insufficient to produce 
a nonzero reinforcement value for N (at 
least for most Ss). When N has not 
acquired reinforcement value, its occur- 
rence will not affect response probability 
(Buchwald, 1959). But since most re- 
sponses made at the start of reversal 
learning will be incorrect, and since N 
follows incorrect RN responses while W 
follows incorrect NW responses, the 
probability of correct NW responses will 
be increased much more frequently than 
that of correct RN responses. Thus, a 
greater number of incorrect RN re- 
sponses can be expected to occur over the 
entire series of trials. 


These considerations also provide part 
of the explanation for the fact that 
induction training affected RN perform- 
ance to a greater extent than it affected 


NW performance. Early in _ reversal 
learning the reinforcement value of N 
has only a minimal effect upon NW 
performance, but it is the chief determi- 
nant of RN performance. Further, when 
N has a positive reinforcement value the 
probability of correct RN responses can 
remain at zero throughout the series of 
trials since the occurrence of each in- 
correct response serves to prevent the 
occurrence of correct responses. But 
even if N should have a negative rein- 
forcement value equal to that of W, the 
probability of correct NW responses can 
be expected to approach .5. (This is 
the most extreme case since it is reason- 
able to assume that W will have at least 
as large a negative value as N.) How- 
ever, even in this case learning may 
proceed since the frequent alternation of 
N and W to members of the same syllable 
pair should serve to reduce, or even to 
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reverse, the negative value of N. From 
these considerations it is apparent that 
the expected number of incorrect RN 
responses when N has a positive value is 
much greater than the expected number 
of incorrect NW responses when N is a 
negative reinforcer. But when N has 
reinforcement values which facilitate 
reversal learning under each of the com- 
binations, the expected numbers of RN 
and NW incorrect responses are virtually 
identical. Hence, differences among 
groups in RN performance should be 
greater than differences in NW per- 
formance. 

Finally, the discrepant results obtained 
from Group R48 remain to be considered. 
The discrepancies are twofold; namely, 
the failure to manifest as great an effect 
of induction training as was shown by 
Group R24, and the fact that the number 
of incorrect RN responses is not less than 
the number of NW responses. Any 
explanations of these results which might 
be offered at this stage would be unduly 
speculative. However, it should be noted 
that, despite the discrepancies, differ- 
ences between R48 and all other groups 
are consonant with the hypothesis being 
investigated. 


SUMMARY 


Two experiments designed to test various 
hypotheses about the reinforcing effects of 
three verbal events were performed. The 
verbal events, R, W, and N, consisted of E 
saying “Right” (R), “Wrong” (W), or 
nothing (N) after Ss response. These were 
used in combinations, RW, RN, NW, in which 
the first named event followed all correct, 
and the second all incorrect responses. The 
instructions made no reference to these events. 

In Exp. I, 36 Ss were presented with 72 
trials on each of three syllable pairs inter- 
mingled in a single series. Each of these 
pairs was assigned to one of the three rein- 
forcement combinations. The results were 
as follows: (a) Fewest errors occurred on the 
RW pair while the other two pairs were not 
differentiated in terms of number of errors. 
(b) The last error occurred earliest on the RW 
pair, and next earliest on the RN pair. 
(c) The number of trials intervening between 
the last and next-to-last error was greatest 
for the NW pair. 
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In Exp. II, seven groups of 12 Ss first 
learned to select one of each pair of syllables 
to mastery under RW. Six groups were then 
given either 12, 24, or 48 trials on a similar 
task, with three of the groups (R12, R24, 
R48) having this induction training under 
RN and three (W12, W24, W48) under NW. 
The seventh group, Group 0, had no induc- 
tion training. All Ss were then exposed to 
48 trials on each syllable pair under conditions 
similar to those used in Exp. I except that the 
originally learned responses were now to be 
reversed. Reversal learning results revealed 
that induction training under RN led to fewer 
RN errors and more NW errors than did 
induction training under NW, and that the 
effect of induction training on number of RN 
errors and differences in number of RN and 
NW errors depended on the number of 
induction training trials. Group 0 made 
fewer errors on all pairs than did the other 
groups. 

The results of these experiments seem to 
confirm the hypothesis that during exposure 
to RN, N becomes a negative reinforcer, 
while. during exposure to NW, N becomes a 
positive reinforcer. There were also indica- 
tions that the assumption that R and W are 
equally potent reinforcers may be slightly 
superior to the assumption that R is a much 
weaker reinforcer than W. 


REFERENCES 


Bucuwa.p, A. M. Extinction after acquisi- 
tion under different verbal reinforcement 


361 


combinations. 
43-48. 

Bus, R. R., & MosTetLter, F. Stochastic 
models for learning. New York: Wiley, 
1955. 

Buss, A. H., Brapen, W., OrGEL, A., & Buss, 
E. H. Acquisition and extinction with 
different verbal reinforcement combina- 
tions. J. exp. Psychol., 1956, $2, 288-295. 

Buss, A. H., & Buss, E. H. The effect of 
verbal reinforcement combinations on 
conceptual learning. J. exp. Psychol., 
1956, 52, 283-287. 

Estes, W. K. Of models and men. 
Psychologist, 1957, 12, 609-617. (a) 

Estes, W. K. Theory of learning with 
constant, variable, or contingent prob- 
abilities of reinforcement. Psychometrika, 
1957, 22, 113-132. (b) 

Estes, W. K., & STRAUGHAN, J. H. Analysis 
of a verbal conditioning situation in terms 
of statistical learning theory. J. exp. 
Psychol., 1954, 47, 225-234. 

Hitcarp, E. R. Methods and procedures in 
the study of learning. In S. S. Stevens 
(Ed.), Handbook of experimental psy- 
chology. New York: Wiley, 1951. Pp. 
517-567. 

S1eceL, S. Nonparametric statistics for the 
behavioral sciences. New York: McGraw- 
Hill, 1956. 

Wilson, K. V. A distribution-free test of 
analysis of variance hypotheses. Psychol. 
Bull., 1956, 53, 96-101. 


J. exp. Psychol., 1959, 57, 


Amer. 


(Received July 2, 1958) 











of Experimental Psychology 


Journal 
Vol. 57, No. 6, 1959 


GUIDANCE IN PURSUIT 


TRACKING ! 


D. H. HOLDING 


Nuffield Department of Industrial Health, King's College, Newcastle upon Tyne, 
University of Durham, England 


One of the prime difficulties in 
learning theory has always been how, 
in the first place, to get a given re- 
sponse to occur. The obvious solu- 
tion, where the trainer literally guides 
S through the correct response, has 
received comparatively little experi- 
mental attention. One of the dif- 
ficulties is that, unless S is set to 
attend to and identify himself with 
the externally manipulated response, 
the learning situation will be poorly 
structured for reinforcing the correct 
response. Thus, with animal Ss, 
Thorndike (1898) failed to teach a 
string-pulling problem by guidance; 
and while Maier (1949) had some 
success in reorienting fixated rats 
by guidance, Haslerud (1951) later 
showed mechanical maze guidance to 
be less effective in changing existing 
habits than in establishing new ones. 

The situation is clearer with human 
Ss, where the appropriate set may 
be established by instructions. How- 
ever, interest in guidance as a human 
training technique has tended to 
lapse since the early maze-learning 
studies of Carr and his associates 
(1930), Waters (1930) and Melcher 
(1934), although these results were 
promising. Von Wright (1957) has 
recently shown that one of the defects 
of these experiments is the lack of 
information about incorrect alterna- 
tives at choice points; where this 


! This work was carried out on behalf of 
the Flying Personnel Research Committee of 
Great Britain, under R. C. Browne. The 
author wishes to thank D. J. Newell and Doris 
Weightman for statistical advice and compu- 
tation, and N. E. Loveless for constructive 
discussion. 


information is given, it is possible 
to bring a task to full learning by 
guidance alone. A more serious de- 
fect is that maze learning is barely 
a skill, in that it requires less fine 
motor adjustment, and more rote 
learning of the task stimuli, than 
does a continuous adjustive skill 
like pursuit tracking. The purpose 
of the present series of experiments 
is to evaluate mechanical guidance 
as a training technique in a continuous 
sensorimotor skill, where it may be of 
use as a form of synthetic training. 


METHOD 


Apparatus.—The task was a form of that 
described by Graham, Baxter, and Browne 
(1951), providing pursuit tracking on a 
vertical scale by means of a rotary control, 
and modified to give automatic following when 
necessary. 

The target pointer moved in a random se- 
quence between 21 points spaced from 0 to 
20 cm. up the vertical scale, reaching each 
point once during one cam revolution of .5 
min. (Course A). Ah alternative Course B 
consisted of movement between the same 
positions in a different permutation. Speed 
of displacement between the positions was 
constant at about 9-cm./sec., with intervals 
between displacements of about .6 sec. The 
control was a 6-cm. rotary knob to the right 
of the display, with a ratio requiring move- 
ment round its circumference equal to the 
pointer displacement. 

The error between target course and re- 
sponse was recorded by an integrator and 
counter, via the output shaft of a differential. 
To administer guidance, the differential was 
locked, and a train of compensating gears was 
inserted; the result was to give normal! be- 
havior to the display accompanied by accurate 
tracking movements of the control and its 
associated pointer. 

Subjects.—Male students and young uni- 
versity staff were used. They were drawn 
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from various faculties, although most were 
medical. 

General procedure.—Trials were in 2-min. 
tracking periods, separated by 3-min. rest 
pauses. All Ss beginning unguided tracking 
were allowed to handle the control before 
their first trial, to prevent artificially high 
error scores due to ignorance of the gear ratio 
and of the clockwise-for-up display-control 
relationship. The score was announced at 
the end of every scored trial, and took the 
form of integrated error-time units. 

For guidance trials, the following pointer 
and the control knob were locked to the 
target course generator. The Ss were told 
that the machine would track itself perfectly, 
and warned that they would be expected to 
do it on their own later on. They were then 
asked to grip the control knob lightly, allow- 
ing the knob to turn their hand; they were also 
asked to watch the pointers during the run. 
In all conditions the run was started with the 
pointer in one of its middle positions. 


EXPERIMENT | 


The main object of Exp. I was to discover 
at what point, on the learning curve of Ss 
practicing normally on Course A, lay the 
scores of Ss who had received guidance. A 
control group (A) therefore performed three 
trials on Course A, while another group was 
tested for 2 min. on A after a 2-min. guidance 
run. These relatively short training trials 
were adopted in order to place the control 
scores on the steepest part of the learning 
curve, as it was essential to the design that 
these scores should differ significantly. Pre- 
liminary work had shown that the learning 
curve on this apparatus was comparatively 
flat. 

Since the target cam repeated its rotation 
during a run, there was a theoretical possi- 
bility that rote learning of the target course 
would occur. A second control group (AB) 
was therefore transferred to Course B after 
one trial on A; while an equivalent guidance 
group (AB) was tested on B after guidance 
on A, to obviate the effects of specificity. 
A third normal-practice group (B) was given 
three trials on Course B, to provide evidence 
for any difference in difficulty between 
the courses which might complicate the 
comparison. 

The 40 Ss were therefore allocated to five 
groups of eight, in order of appearance. 


Results.—The scores in all experi- 
ments were transformed logarithmi- 
cally. Course B showed a marked 
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tendency towards greater difficulty 
than Course A, but analysis of vari- 
ance of the six trials by the Control 
A and Control B groups failed to 
confirm the difference at the 5% 
level. 

The scores for Guidance A fell at 
about the same point on the curve 
as those for Trial 2 of Control A, 
and an analysis of variance was carried 
out on Control A Trial 1 versus Guid- 
ance A and Control A Trial 2. This 
showed that there was no significant 
difference between the guidance test 
trial and the control Trial 2, while 
the control Trial 1 was significantly 
different (P < .05) from the other 
two; it was concluded that one guided 
trial was approximately equal to a 
normal voluntary 2-min. training 
period. 

Reference to other data suggests 
that the amount learned during one 
2-min. trial on this apparatus is quite 
large. The mean error for Trial 1 
of Control A Ss works out at approxi- 
mately 7.7 mm., improving to 6.5 mm. 
on the second trial.2 An estimate 
of the total learning curve, obtained 
from four Ss who were able to return 
on other days to complete nine trials 
each, suggests that the final average 
error is unlikely to be less than about 
4.6mm. The benefit due to guidance 
(at 6.6 mm.) is thus over one-third of 
the estimated total learning. 


TABLE 1 


GUIDANCE AS LEARNING (Exp. 1) 





Source 





Control A 1 vs. A 2 and 
Guidance A 

Between A 2 and Guidance 
A 


1 sass 5.39 | 0S 


0008 | 0.16 | 
0051 | 





? The mean error at the display, in mm., 
is approximately 1/10th of the antilog of the 
scores used in the analyses. 
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TABLE 2 
GUIDANCE IN A-B TRANSFER 
(Exp. 1) 
Source |a|ms| r | P 
Control B 1 vs. B 2, AB 2 | : 
and Guidance AB | 2 | 0161 | 2.27 | >.05 
Between B 2, AB 2, and 
Guidance AB 2 | 0010) 0.14 
Residual 28 | 0071 
Total | a 


| | 





There was no significant difference 
between the Control AB Trial 1, 
and the Guidance AB and Control 
AB Trial 2; but this would not be 
expected if both were transferring 
to a more difficult course. Trial 1 
of Control B was tested against the 
three scores falling at about the same 
point on the B curve—Controls B and 
AB Trial 2, and Guidance AB— 
with nonsignificant results. There 
was no significant difference between 
the last three, and it may be tenta- 
tively assumed that the transferred 
guidance was equivalent in training 
value to a normal training trial, 
although the evidence is not con- 
clusive. 

As a further check on the value 
of guidance, the test trials for Guid- 
ance A and AB were pooled for testing 
against Trial 2 scores of Controls A 
and AB. There was no significant 
difference (F = .4, 1 and 29 df). 
This was taken to support the efficacy 
of guidance. 


EXPERIMENT I] 


As the evidence in Exp. | for transfer 
of guidance relied on nonsignificance, and 
therefore did not constitute proof, the transfer 
trials were repeated in the reverse direction. 
A further series of 20 Ss was divided into two 
equal groups. The control group practiced 
on Course B and subsequently transferred to 
A, while the experimental group was guided 
on B and subsequently tested on A. 


Results.—The guided Ss transferred 
at least as well to Course A as did the 
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control Ss. A difference between the 
guided test trial and the second con- 
trol trial, in favor of the guided Ss, 
was not significant. There was a 
highly significant difference between 
the first control trial and the guidance 
test and second control trials. 

The absolute amount transferred 
from B to A (Control A Trial 1 minus 
Control BA Trial 2) was greater than 
the amount transferred from A to B 
(Control B Trial 1 minus AB Trial 2), 
but in the absence of full learning 
curves for Course B it is not possible 
to state that difficult-to-easy transfer 
was superior to easy-to-difficult trans- 
fer. 


EXPERIMENT III 


This experiment attempts to assess the 
relative contributions of the kinesthetic and 
visual components of guidance, as a first 
stage in further analysis. 

Eighteen Ss in one experimental group 
were blindfolded during the guidance run, 
receiving only kinesthetic information from 
the rotation of the knob. A further group 
of 18 Ss folded their arms while they watched 
the pointers tracking in unison; they were 
not prevented from watching the rotating 
control. 

To confirm the findings of Exp. I, and to 
form a baseline for comparison with the 
kinesthetic and visual groups, control and 
full guidance groups of 10 Ss each were again 
included. 


All runs were on Course A. 


Results.— Analysis of variance show- 
ed that it was unobjectionable to com- 
bine the scores of control and full 
guidance Ss in Exp. III with those 


TABLE 3 


GUIDANCE IN B-A TRANSFER 
(Exp. Il) 


Source | df MS F P 
Control BA 1 vs. BA 2 and | * 
Guidance BA | 1 | 0851 | 16.26) .001 
Between BA 2 and | 


Guidance BA 1 | .0089 | 1.71 
Residual 27 | .0052 
Total | 29 | 
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TABLE 4 


VISUAL AND KINESTHETIC GUIDANCE 
(Exp. I & III) 











Source |a| us| F | p 
Kinesthetic effect | 4 | 0077 | 1.74 | 
Visual effect 1 | .0334| 7.56 | .01 
K-V Interaction | 1 | 0010) 0.23 
Residual 68 | .0045 | 


Total 71 


of the equivalent groups in Exp. I. 
With the combined groups of 18 Ss 
in each category, the difference be- 
tween the first control trial and the 
guidance test trial emerged as sig- 
nificant (P < .01). The difference 
between the second control trial and 
the guidance test trial remained not 
significant. 

Using the combined data, the 
improvement over the first control 
trial due to full guidance was ap- 
proximately 10 mm., while that due 
to visual guidance was 6 mm., and 
kinesthetic guidance 2 mm. of mean 
error. The data were treated fac- 
torially; the visual effect was signifi- 
cant (P < .01), while neither the 
kinesthetic effect, nor the kinesthetic- 
visual interaction was significant. 


DiscussiON 

In this kind of tracking an amount of 
guidance has an effect roughly equal to 
a similar amount of normal practice, 
early in training, even though normal 
learning was encouraged by pretest 
familiarization and knowledge of score. 
The work has been restricted to early 
training; it is extremely difficult to prove 
the effect of guidance after greater 
amounts of learning, since the chances 
that points further down the learning 
curve will differ significantly 
successively reduced. 


become 


It is far from clear that the effects 
of guidance are limited to the early 
stages of learning; Carr's (1930) con- 
clusions on the 


relative decrease in 
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effectiveness of larger amounts of guid- 
ance are not confirmed by Waters (1930). 
Obviously, as in any other case of trans- 
fer, the effectiveness of guidance will 
vary with its degree of simulation of 
the final task, and this will affect its 
efficiency more in the later stages of 
training. In any synthetic training 
device, overlearning on the trainer tends 
to impede transfer to the real task. In 
any case, where a decrease is relative 
to a large initial advantage, the early 
benefit will be of use in many training 
situations, where the early, fumbling, 
stages of learning are to be avoided. 
Where an operation may be performed 
in several different ways, guidance has 
the practical advantage of obviating 
the fixation of inefficient techniques; 
again, where a task to be learned con- 
flicts with the existing repertoire of S, 
as in learning an adverse display-control 
relationship, guidance may have the 
advantage. 

There is no reason to think that guid- 
ance will not be effective for a 
range of pursuit tasks. The amount of 
control movement which is usually 
necessary is well above the level for 
accurate discrimination of passive move- 
ment, which tends to be below 1° of 
movement under most conditions (Cleg- 
horn & Darcus, 1952). On the other 
hand, it may be predicted that the bene- 
fit on a compensatory task will be slight, 
as compensatory guidance would mimic 
the kinesthetic condition of Exp. III. 

The failure of the kinesthetic effect 
to reach significance was probably due 
to the fact that the guidance procedure 
was S's first experience of the tracking 
task, and it seems likely that it would 
have benefited from 
preliminary tracking Poulton's 
(1957) comparison of obscuring the 
input or the response pointer shows 
clearly that knowledge of task require- 
ments is prior to knowledge of the means; 
while Lincoln's (1956) moderate success 
in teaching rates of movement, 
interpolated kinesthetic examples, is 
that the contribution of 
kinesthetic guidance need not be nil. 


wide 


insertion after a 


trial. 


with 


an indication 
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‘The success of the visual technique is in 
accordance with both Melcher’s (1934) 
findings and those of Twitmyer (1931). 

Since, in the full guidance case, learn- 
ing by passive movement does take place, 
it is no longer possible to hold that motor 
learning cannot occur without knowledge 
of results. The model of S as a servo, 
attaining motor adjustments through 
successive mismatches, does not fit the 
guidance procedure. We must obvi- 
ously, with Annett and Kay (1957), 
distinguish between knowledge of results, 
as the feedback component of the servo 
model of skill, and extraneous knowledge 
of score. To the extent that knowledge 
of results is equated with error feedback, 
two differences are obvious in the case 
of guidance: there is no error to feed 
back, since the automatic tracking is 
perfect; and there is no feedback, since 
the kinesthetic stimuli of guidance are 
not in fact fed back as the result of any 
efferent quantities. 

The theory that knowledge of results 
is indispensable conceals the distinction 
between knowledge of results and knowl- 
edge of the correct response. What 


seems to be necessary for the acquisition 
of motor skill is the occurrence of the 
correct response; and knowledge of re- 
sults is merely an incidental, though 
common, means of achieving the correct 
response. 


SUMMARY 


Three experiments on pursuit tracking 
were conducted with 116 Ss. The apparatus 
was modified so that guidance could be 
administered mechanically. 

It was found that guidance at an early 
stage produced learning to about the same 
extent as did normal practice. This was the 
case when a single target course was used; 
and when Ss transferred learning on a first 
target course to tracking a second. Transfer 
was carried out in both directions between 
the tasks. Full guidance was more effective 
than visual guidance ; and visual guidance was 
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superior to kinesthetic guidance, which gave 
no significant benefit in this instance. 

These results suggest a reinterpretation 
of the place of knowledge of results in the 
theory of motor skill, and a new evaluation 
of guidance as a training technique. 
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Perception of postural verticality 
as a function of learning has received 
little primary attention (Fleishman, 
1953; Passey & Guedry, 1949) until 
recently when Solley (1956) presented 


positive evidence of the effect of such , 


learning. This was in the form of a 
clearly defined negatively accelerated 
curve and was obtained by plotting 
the magnitude of perceptual error for 
30 successive trials of lateral tilt. 

Of particular interest to the present 
authors is the fact that following each 
trial judgment Ss were returned 
from their judged vertical position 
to the true vertical and told they were 
then “upright.” Clearly, such pro- 
cedure would seem to provide some 
degree of knowledge of results (KR) 
and, equally clear, is the possibility 
that this might have been largely 
responsible for the rate and degree 
of learning achieved. 

The present study therefore was 
designed to determine the contribu- 
tion made by KR presented in this 
proprioceptive manner, the compara- 
tive effectiveness of KR presented by 
visual means, and the nature of the 
interaction of these presumed effects 
with alternation of the direction of 
lateral tilt. 


METHOD 


A pparatus.—The lateral-tilt chair and its 
essential components have been described 
elsewhere (Pearson, 1957). A shutter me- 
chanism, an integral part of the chair, was 
used only in conjunction with the visual KR 
device shown in Fig. 1. This device, wall- 
mounted opposite the shutter, consisted of 


a transparent disk pinned at its center to a 
fixed black board. On the latter was painted 
an error scale, true vertical, and horizon 
indicators. The perceived vertical indicator 
was painted on the plastic disk which, when 
rotated by E, clearly indicated the magnitude 
of departure from true vertical. The paint, 
a luminous substance, was energized under 
red light. The S's control panel, modified 
for the present study, contained adjustable 
leather sleeve cuffs to support S’s forearms 
at tilt positions and a three-position switch 
providing S with directional control of the 
chair which, of course, was also under the 
independent control of E. 

The test room was finished with a flat 
black paint and was completely light proof. 
Both red and white light were available to E 
at his discretion. Temperature, humidity, 
and ventilation were adequately controlled 
within the room. 

Design.—The experimental conditions were 
arranged in conventional factorial design 
involving two variables, KR and alternation 
of tilt. The former consisted of the following 
subclasses: KR,;—no KR (control) in which 
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Fic. 1. Visual KR apparatus. 
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S was moved directly to tilt following his 
trial judgment; KR»—postural KR in which 
S was moved to the true vertical following 
each judgment; KR,—visual KR in which S 
was shown a visual representation of his 
position following each judgment. The sec- 
ond variable consisted of A;—24 trials at 30° 
left tilt; Az—24 trials alternately 30° left 
and 30° right tilt. The Ss were assigned in 
equal numbers and randomly to each of the 
six cells. 

Procedure.—The S was secured in the 
tilting chair following adjustment of the seat 
to his sitting height, given detailed instruc- 
tions, and told he would receive 24 trials. 
No mention was made of the actual degree 
of tilt. If S had no questions, testing then 
commenced. The E moved S to a position 
of tilt, then, following a delay period of 30- 
sec., transferred control of the chair to S 
who made his judgment. When satisfied 
with his position, S responded “OK” or 
“All right." The E then recorded the direc- 
tion and amount of error. Delay at the 
perceived vertical was 20 sec. For the visual 
KR Ss, E, after setting the KR device, opened 
the shutter for 5 sec. during the latter half of 
the delay period. In the case of the postural 


Gnas @ KR CONTROL 


° KR, POSTURAL 


side KRY VISUAL 








ALTERNATION 
——* A, ALL LEFT TILT 
watts on A, “ALTERNATELY LEFT 
ANDO RIGHT 


DEGREES ERROR 











7 8 8 13 16 17 
MEASURES 


19 21 23 25 


Fic. 2. 
nation 
vertical. 


Main effects of KR and Alter- 
upon perception of the postural 


RICHARD G, PEARSON AND GEORGE T. HAUTY 


KR, S's delay at the true vertical was 10 sec. 
Thereafter, E proceeded in accordance with 
the KR condition to which S was assigned. 
All 24 trials were completed in one session. 

Subjects.—The Ss were obtained from a 
population of volunteer, male, basic Air 
Force trainees. Four were tested per day 
until a sample of 96 was attained. 


RESULTS 


Consideration was first given to the 
absolute magnitude of error in pro- 
prioceptive perception of the vertical. 
Figure 2 plots the average errors for 
the KR and Alternation main effects. 
One is immediately impressed with 
the reduction in error that occurs for 
all conditions as testing progresses. 
Considering first the upper half of 
Fig. 2 one sees the beneficial effects 
of KR upon performance. Postural 


KR _ initially has a greater positive 
effect than visual KR, but eventually 
(about Trial 8) the visual subgroup 
attains the same performance level 
as the postural subgroup. 


Turning 
now to the lower half of Fig. 2, it is 
to be noted that the alternation (A») 
subgroup has a marked early reduc- 
tion in error as compared with the 
control (A,;) subgroup, and, in general, 
maintains this superior performance 
throughout testing. The above find- 
ings are supported by the statistical 
results, shown in Table 1, based upon 
a repeated measurements analysis 
made withing the split-plot design 
of KR (3) X Alternation (2) X Meas- 
ures (24). It is to be noted that all 
main effects and their first-order 
interactions are statistically significant. 

In Fig. 3, the principle course of 
these significant interactions is clearly 
resolved. Under the A; conditions, 
the no-KR group evidences a near 
linear reduction in error whereas the 
KR groups produce the curves of a 
quadratic function. Under the A, 
condition, on the other hand, the 
no-KR group performs quite differ- 
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TABLE 1 


ANALYSIS OF VARIANCE OF ABSOLUTE ERRORS 


Source 





KR 
Alternation (A) 
KR XA 


2 781.73 10. 4g*** 
1 | 634.94; 8.52** 
2 | 322.92 | 4.33* 
Error (a) 90} 74.52 


Measures (M) 23 | 177. 99 | | 39.12*** 


M X KR 6.5 
M XA 3) 10. 
M X KR XA 4. 

4. 


Error (b) |. 070 | 


Total 2, 30: 3 | 


*P < OS. 
oP < 01. 
“P< O01. 


ently. Its curve is scarcely distin- 
guishable from the quadratic curves of 
the KR groups. 

It should also be mentioned that 
two additional repeated measure- 
ments analyses were performed upon 
the average errors. In one, only the 
data from the odd-numbered trials 
were analyzed; here there was a 
comparison of measures made only 
at the left-tilt position and equated 
in time. In the second analysis the 
data from the first 12 trials for the 
A, condition were compared with 
that from the odd-numbered trials 
for the Az condition ; here we equated 
for the number of successive measures 
made at left tilt. Suffice it to say 
that the results of these analyses 
were identical with the previously 
described one, namely, that all main 
effects and their first-order interac- 
tions were statistically significant. 

Finally, the authors gave considera- 
tion to the directional aspects of 
error in perception of the vertical. 
The results of plotting the constant 
errors for the odd-gumbered trials 
were almost identical, in terms of 
slope and of the relative position of 
the curves, with those from the plot- 
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ting of the absolute errors. Also, the 
constant errors for the even-num- 
bered trials, right-tilt data only, were 
plotted. Here, in the case of both 
the Alternation main effects and the 
interaction effects (KR,A»s, KRoeAo, 
and KR;A,), the performance curves 
were essentially of the same slope 
as their “‘left-tilt’ counterparts. Anal- 
yses similar to those performed on the 
average error data were performed 
on the constant error data. Without 
going into detail it can be said that 
the statistical results supported the 
graphic findings, and, in general, 
confirmed the results obtained with 
average errors. 


DISCUSSION 
It has been shown that Ss, with 
repeated testing, will demonstrate a 
significant reduction of error in percep- 
tion of the vertical. Thisis even true for 
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Ss who are tilted repeatedly in the same 
direction and who receive no knowledge 
of their performance. To the authors 
this evidence suggests that the early 
effect of sensory adaptation to tilt is 
to a degree overcome, as testing pro- 
gresses, by proprioceptive learning.' 

It was further found that when Ss are 
tilted repeatedly in one direction, both 
postural and visual KR served to bring 
about improved performance as testing 
progressed. Inasmuch as returning S 
to the true vertical following each trial 
judgment does have a positive effect 
upon performance, it is suggested that 
careful consideration be given to the 
inclusion of postural KR in the design 
of similar studies. 

Most noteworthy were the findings 
with respect to the effects of ‘‘alterna- 
tion” upon performance. It was shown 
that the performance of a group of Ss 
tilted alternately left and right was 
significantly superior to that of a group 
tilted repeatedly to the left. Further- 
more, the effect of alternation was such 
that neither visual nor proprioceptive 
KR could contribute anything addi- 


tionally to the reduction in error pro- 


vided by alternation alone. In other 
words, under these conditions, alterna- 
tion was the main variable contributing 
to the reduction of perceptual error. 
To explain these findings one must note 
that on all even-numbered trials the 
A» (alternation) subgroup has received 
different sensory stimulation than the A, 


'Solley (Personal communication, Janu- 
ary, 1959) may have anticipated us here in 
the case of what appears to be a kind of 
spontaneous perceptual improvement. He 
reports “. . . a striking amount of learning, 
though there was no knowledge of results” 
in a group of Ss tilted alternately left and 
right. On the basis of the results of our 
present study we feel that the mere fact of 
alternation itself may have accounted for 
his results. Until a determination can be 
made as to which variable, sensory adaptation 
or proprioceptive learning, is responsible 
for the improvement observed in our study, 
we must regard the question of whether 
perceptual learning can occur without specific 
knowledge of results as merely an hypothesis. 
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subgroup. In Fig. 2 note especially 
the magnitude of the initial divergencv 
between the subgroups at Trial 2 where, 
it is to be remembered, the only differ- 
ence between them here is the direction 
of tilt. It would seem that in the case 
of the A» subgroup some of the sensory 
adaptation occurring at Trial 1 has been 
carried over to Trial 2, and thus has 
partially mitigated, or competed with, 
the sensory adaptation occurring at right 
tilt. Similarly it might be argued that 
at Trial 3 adaptation to left tilt for the 
A, subgroup has been partially mitigated 
by the previous adaptation to right tilt. 
Yet, another explanation seems to be 
equally plausible and is inherent in the 
fact that the A» subgroup on even- 
numbered trials receives “information” 
over and above that received by the A, 
subgroup; in other words, the Az sub- 
group, in passing through the vertical, 
enjoys a unique experience in that it 
learns what it is like to be on the other 
side of true vertical. This additional 
information may be expected to modify 
S's subsequent judgments. Of course 
the possibility should not be overlooked 
that both experiential factors and com- 
petition between sensory events may 
jointly contribute to S’s increased effici- 
ency in perception of verticality as test- 
ing progresses. In any case the above 
findings suggest that the practice of 
moving S to the opposite quadrant before 
moving him to the trial position (Mann, 
Berthelot-Berry, & Dauterive, 1949; 
Solley, 1956) cannot be considered just 
a precaution against the possibility that 
S would count himself back to the true 
vertical, but must be considered a sig- 
nificant part of a study’s methodology 
which therefore can play an important 
role in determining performance. 

A final point carries another implica- 
tion for studies of spatial orientation. 
Solley’s conclusion that “. . . use of 
so-called ‘random orders’ of presentation 
of conditions in kindred perceptual exper- 
iments should be looked upon with 
suspicion” (Solley, 1956, p. 332) is well 
made. In the light of our findings with 
regard to proprioceptive learning it is 
further apparent that the practice of 
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averaging the results of several consecu- 
tive trials made under certain conditions 
should also be looked upon with sus- 
picion. .For example, under conditions 
where Ss are tested in both right and 
left quadrants, it may well be that 
with repeated testing the effects of such 
variables as delay at tilt and speed of 
chair movement would cease to have a 
significant effect upon performance. It 
is noteworthy that the performance 
curves for the Alternation variable of the 
present study have approximately the 
same slope as those from Solley’s experi- 
ment although there are several basic 
differences in design methodology be- 
tween the two studies. 


SUMMARY 


Ninety-six Ss were tested on a lateral-tilt 
chair under darkroom conditions. The task 
consisted of 24 trials and required S to return 
himself from a position of 30° tilt to the per- 
ceived vertical. It was found that: (a) Ss 


with repeated testing demonstrated a reduc- 
tion of error in perception of the vertical, 
probably as a result of proprioceptive learn- 
ing; (b) return to the true vertical following 


each trial judgment (postural knowledge of 
results) and visual representation of error 
(visual knowledge of results) both had a 
significant positive effect upon reduction of 
error; (c) Ss tilted alternately left and right 
performed significantly superior to Ss tilted 
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repeatedly to the left. In the case of the 
alternation Ss it was further found that 
neither visual nor proprioceptive knowledge 
of results were contributing anything addi- 
tionally to reduction of perceptual error than 
that provided by alternation alone. An 
attempt was made to explain this finding 
in terms of experiential factors and competi- 
tion between sensory events resulting, in the 
case of the alternation group, from the mere 
process of passing through the true vertical. 

Finally, implications of the findings for 
the design of spatial orientation studies were 
discussed. 
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In the two experiments reported in 
this paper, the effects of syllable famil- 
iarization on three separate phenom- 
ena were investigated. These were: 
(a) syllable association value, (6) 
speed of learning and the nature of 
error production in a serial rote learn- 
ing task, and (c) reminiscence. 

The first two phenomena were 
examined to test hypotheses concern- 
ing the reason familiarization reduces 
trials to learn. In their paper, Hov- 
land and Kurtz (1952) suggested 
three different ways by which the 
effects of familiarization could influ- 
ence learning speed. These three 


ways are: (a) by increased differentia- 
tion among the items within a list 


(cf. Gibson, 1940); (6) by increased 
differentiation between familiarized 
and unfamiliarized items; (c) by a 
reduction in the pool of alternative 
responses available to be anticipated. 
A proposal similar to (c) has also 
been made by Noble (1955). Each 
of these hypotheses has implications 
for the nature of errors made in 
learning familiarized items. 

An additional hypothesis not con- 
sidered by Hovland and Kurtz is that 
familiarization produces its effect 
through increasing the availability of 
meaningful mediators. The media- 
tors in turn would facilitate the estab- 
lishment of linkages between the 
syllables in a series. Specifically, this 
hypothesis would assert that famil- 
iarization would speed learning by 


' This study was supported by a grant from 
the National Science Foundation. 


increasing the association value of 
familiarized syllables. Although the 
effect of association frequency (Noble's 
m) on learning has been well-docu- 
mented by Noble (1952a, 1952b) and 
by Mandler and Huttenlocher (1956), 
surprisingly little is known concerning 
the experimental variables that can 
produce changes in association value. 
It is likely that if words are paired 
with syllables in a familiarization 
procedure, the association value of 
the syllables will increase. It is 
less obvious but possible that sheer 
familiarization might also influence 
association value. By increasing the 
strength of the associative connections 
between the letters in a_ syllable, 
unitary responses to these letter 
combinations may develop. These 
responses may in turn elicit meaning- 
ful associates that act as mediators. 
Conversely, it is also possible that 
one of the ways by which a subject 
recalls the letters that form syllables 
is through an active search for mean- 
ingful associates that aid in the recall 
of all the letters in a syllable. Either 
process could affect association value. 
Regardless of the mechanism, how- 
ever, increase in association value is 
clearly a possible result of familiariza- 
tion. That changes in association 
value of syllables may occur without 
explicit pairing of syllables with 
meaningful items was demonstrated 
by Hull (1933). He showed that a 
few trials of rote learning increases 
the number of nonsense syllables that 
elicit meaningful associates. There- 
fore, in view of the known relation- 
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ships between association value and 
learning speed on one hand, and 
between familiarization and learning 
speed on the other, the relationship 
between familiarization and associa- 
tion value is of systematic interest, 
both as a problem in its own right 
and because of its bearing on the 
familiarization-learning problem. 

The other problem that was in- 
vestigated was the effect of the 
familiarization procedure on perform- 
ance following a rest interval at the 
point of approximately 50% learning. 
The procedure followed was a com- 
bination of those used by Ward (1937) 
and by Hovland (1938). Meaning- 
fulness (cf. Melton & Stone, 1942) 
and more directly, interlist and intra- 
list interference as investigated by 
Underwood and Richardson (1955) 
and by Riley (1957), have occupied 
central positions in the analysis of 
distribution of practice and reminis- 
cence effects. These studies suggest 
that the extent to which a rest interval 
benefits further performance depends 
on the degree of interference prior to 
the rest. If familiarization influences 
error occurrence prior to a rest inter- 


val, it should also exert a correspond- 


ing effect on postrest performance. 


METHOD 
Materials 


In both Exp. I and Exp. II, the experi- 
mental materials consisted of six practice 
lists and four critical lists. The practice 
lists were used to train Ss in the technique 
of serial learning. The critical lists were 
used to test the hypotheses discussed in the 
introduction. 

The practice lists were 12-item serial lists 
constructed from Glaze’s (1928) syllables. 
The lists averaged between 25% and 30% 
association value. The syllables in each list 
were selected so as to minimize intralist 
similarity. Within each list, all syllables 
had different first letters; all had different 
last letters. No vowel appeared more than 
three times in any list. No syllable appeared 
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in more than one list. The same practice 
lists were used in both experiments. 

In Exp. I each of the four critical lists was 
a 12-item serial list. The syllables were 
from Glaze’s lists. These lists averaged 
between 19% and 22% association value. 
The rules of list construction were the same 
as for the practice sets. There was, however, 
one additional restriction: no syllable had 
the same first two letters or the same last two 
letters as any other syllable in any of the four 
critical lists. Between 16 and 18 consonants 
were used in the construction of each list. 
No critical list syllables were taken from the 
practice lists. 

In Exp. II the critical lists were constructed 
according to the same rules as the critical 
lists of Exp. I. These new lists differed 
from those of Exp. I in that each list had 15 
syllables rather than 12 syllables. This 
increase resulted in more repetition of con- 
sonants, but such repetition was held to a 
minimum. Between 17 and 20 consonants 
were used in the construction of each list. 


Experiment I 


For each of 20 paid male student Ss, the 
experiment extended over seven daily sessions 
of 1 hr. each. During Days 1-4, Ss memo- 
rized the six serial practice lists. In addition, 
Ss were given special familiarization training 
on the syllables of two of the four critical 
lists. On Day 5, Ss were given an association 
test on the items of all four critical lists. 
On Days 6 and 7, Ss learned the four critical 
lists. 

Practice sessions.—Each S learned the six 
practice lists by the serial anticipation method. 
One list was learned on each of the first two 
days; two lists were learned on Day 3, and 
two on Day 4. The Ss were required to spell 
out their anticipations and to correct errone- 
ous responses on seeing the correct response. 
The items appeared at a 2-sec. rate on a 
memory drum, with 6-sec. between trials 
Each S learned three of the lists with a 2-min. 
rest interval following the trial on which 
seven syllables were first correctly antici- 
pated. During the interval, Ss named colors 
that appeared at a 2-sec. rate on the memory 
drum. The other three lists were learned 
without the rest interval. Four orders of 
counterbalancing the learning of rest (R) 
lists and nonrest (NR) lists were used, of 
which R, NR, NR, R, NR, R is an example. 
Five Ss followed each schedule. 

Syllable familiarization.—During the four 
practice days, each S learned to recognize the 
24 syllables that comprised two of the four 
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critical lists. The choice of the particular 
lists on which each S received this familiari- 
zation training was determined by S’s random 
assignment to one of four rows in one of five 
greco-latin squares. 

Familiarization training consisted of show- 
ing S a series of 3 X 5 in. cards, each one of 
which had on it either the first two or the 
last two letters of a nonsense syllable. For 
each syllable there were two cards, one of 
each variety. As each card was presented, 
S was required to spell out all three letters 
of the syllable from seeing only two. After 
approximately 3 sec. the card was turned 
over and S was shown all three letters. At 
the end of a trial, those cards which had been 
correctly recognized were dropped from the 
pack, and the remaining cards were shown 
in the same manner on the next trial. This 
procedure was continued until all the cards 
had been correctly recognized. Then all 48 
cards were shown again in the same way. 
This procedure was continued until all 48 
syllables had been correctly completed on two 
successive errorless trials. The cards were 
shuffled between each exposure of the re- 
maining items in the pack. All Ss attained 
the level of proficiency just described prior 
to Day 5. At the beginning of Day 5, each 
S was again required to recognize all 48 cards 
in one trial, the same procedure being followed. 

Association test.—F ollowing the completion 
of familiarization, Ss were shown, in random 
order, a series of 48 cards, each with all three 
letters of a syllable on it. Twenty-four 
of the cards had the familiarized items, the 
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others had the unfamiliar syllables from the 
other two critical lists. As each card was 
presented, S was required to respond as 
rapidly as possible with as many words as the 
syllable reminded him of, for a period of 30 
sec. He was instructed to give only words 
suggested by the syllable and not to give 
chains of associations (Noble, 1952a). The 
E recorded both the time to the first response 
and the responses given. 

Critical lists —On Days 6 and 7, S learned 
the four critical lists. The procedure was the 
same as for the practice lists. One of the 
lists containing familiarized syllables and 
one of the lists containing nonfamiliarized 
syllables were learned with a 2-min. rest 
following the trial on which seven syllables 
were first correctly anticipated. The other 
“familiar” list and the other “unfamiliar” 
list were learned with the regular 6-sec. break 
at that point. These four conditions are 
hereafter designated NR-F (nonrest, famil- 
iar), R-F (rest, familiar), NR-U (nonrest, 
unfamiliar), and R-U (rest, unfamiliar). 
Two lists were learned each day. The order 
in which each S learned the four critical lists 
was determined by the same set of greco- 
latin squares that determined the pairings 
of Ss and lists. In each, the rows were Ss, 
the columns were the order in which the 
lists were learned, the greek letters were lists, 
and the roman letters were conditions. To 
illustrate the assignment of Ss to conditions, 
one of the squares is presented in Table 1. 


Experiment II 


Except for two changes in procedure, 


Exp. II was a repetition of Exp. I. The two 
changes were that the association test was 
given after the learning of the four critical 
lists rather than before, as in Exp. I, and the 
length of the critical lists was extended to 15 
items. In all other details, the two experi- 
ments were the same. 


RESULTS 


Practice lists.—Because a detailed 
analysis of the results of the practice 
list performances in the two experi- 
ments did not shed light on the 
hypotheses under consideration, the 
analyses will not be presented. The 
only significant difference between 
the R and NR conditions was in the 
number of correct anticipations on 
the trial following the criterion of 
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TABLE 2 


Exp. I 





Median Latency 
of First Response 
(Sec.) 








5.19 
5.06 





seven correct. Although both condi- 
tions exhibited a drop in performance, 
the drop was significantly greater 
in the R than in the NR condition. 

Association tests —The association 
tests for both experiments are sum- 
marized in Table 2. For each S two 
median latency scores were obtained : 
one F score and one U score. The 
means of these medians are shown 
in the left column of the ‘table. 
Similarly, the mean number of re- 
sponses in 30 sec. to F syllables and to 
U syllables was determined for each 
S. The means of these values are 
shown in the right column of Table 
2. The values of the ¢ tests at the 
bottom of the table indicate that 
familiarization had no ascertainable 
differential effect on either measure 
of association. 

Experiment I critical lists.—Table 3 
shows the mean number of trials Ss 
used to achieve 7 correct anticipations 
in one trial, the mean number of 
trials to 12 correct anticipations, and 
the mean number of trials from 7 
correct to 12 correct. Three analyses 
of variance were run; one on each of 
the sets of scores summarized here. 
The only significant main effect was 
the effect of familiarization prior to 
7 syllables correct (F = 6.62; df = 1 
and 19; .01 << P < .05). Neither the 


2.64 
2.56 


AO \oyw = 1.33;t = 1.04 


Median Latency 
of First Response 
| 


Mean 


4.08 
4.00 


| men | 90 


3.58 
1.14 


3.42 
3 


oy = .34;t = 1.26 


| 
| 
| 
effects of the rest interval nor the 
interaction between rest and famil- 
iarization approached statistical sig- 
nificance. Examination of the SDs 
in Table 3 suggests heterogeneity of 
variance, but analysis of log trans- 
formations of these data did not alter 
conclusions reached from the present 
analysis.? Neither did the rest pro- 
duce any significant effect on the 
change in performance from the 
criterial trial of seven correct to the 
next trial. These means are shown in 
Table 4. The Fs for the effect of rest 
and the main interaction were both 
less than 1. The F for Familiariza- 


TABLE 3 


MEANS AND SDs oF TRIALS TO INDICATED 
SYLLABLE CRITERIA IN CRITICAL Lists: 
Exp. I 

| 


| Trials from 7 
to 12 Correct 


Trials to7 | 
Correct 
Cond. | 


Trials to 12 
Correct 


| Mean | SD | Mean| SD 


— — — — -_ = | = = 
NRU | 7.50 | 3.88 | 20.45| 9.27] 12.95 | 8.07 
RU | 6.90 243| 1490 10.02 | 11.50 | 7.50 
| 


| Mean SD 


NRF | 5.30 | 2.43 | 14.90 | 6.46) 9.60) 5.48 
RF | 6.40 15.60; 4.48) 9.20) 2.25 


1.86 
wa 


2A search of the literature indicates that 
procedure for testing for homogeneity of 
variance in correlated samples has not yet 
been developed. 
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TABLE 4 


SYLLABLES Correct ON 7/12 CrITERION 
TRIAL AND ON THE NEXT TRIAL IN 
Criticay Lists: Exp. | 


Trial Trial 
| /12 7/12+1 


Cond. 


| 
| | 
| Mean 


i | 7.80 | 
7:55 | ‘81 


93 | 7.55 | | 


| 7, 
7.80 | . | 7. 
r | 7.55 | 6 


| 


tion was 2.32 (df =1 and 19). 
Despite this general lack of signifi- 
cance in the main results, two major 
classes of overt errors were signifi- 
cantly influenced by the familiariza- 
tion procedure. The mean number 
of intralist errors per trial increased 
from .37 (U) to .72 (F), (P < .01). 
The mean number of all other overt 
errors per trial decreased from .88 
(U) to 46 (F), (P < .01). Since 
the major effects were more reliable 
in Exp. II, a detailed analysis of the 
errors will be deferred until later. 
Experiment II critical lists —There 
were two reasons why Exp. II was 
run with the modifications listed in 
the method section. First, the effect 
of familiarization on learning speed 
was very unreliable. The failure to 
find a larger effect might have been 
because the U items became some- 


TABLE 5 
TRIALS TO INDICATED SYLLABLE CRITERIA ON 
Critica Lists: Exp. Il 


Trials to 15 | Trials from 7 
Correct to 15 Correct 


Trials to 7 
Correct 


3.05 | 28.20 
3.32 | 21.55 
2.38 | 22.75 


8.25 
6.40 
6.05 





what familiar during the association 
test. Consequently, the association 
test was placed after the learning of 
the critical lists. The purpose of the 
increase in the length of the critical 
lists from 12 to 15 syllables was to 
increase the difficulty of the lists and 
thereby increase the possibility that 
some effect of the rest would appear. 

The number of trials to criteria 
of 7 correct, 15 correct, and from 7 
correct to 15 correct are shown in 
Table 5. Familiarization significantly 
reduced trials to learn in all three cases 
(trials to 15 correct, F = 11.38, df = 1 
and 19, P < .01; trials to 7 correct, 
F = 16.11, df = 1 and 19, P < .01; 
trials from 7 correct, F = 6.51, df = 1 
and 19, .01 <P < .05). No other 
experimental manipulation was sta- 
tistically significant. 

Table 6 shows the mean correct 
responses on the trial on which seven 
syllables were first correctly antici- 
pated and on the next trial. In the 
analysis of variance on the change in 
the number of syllables correct, both 
the effect of rest and the effect of 
familiarization resulted in F's of less 
than 1. However, the interaction 
of familiarization and rest was signifi- 
cant at the 2% level (F = 7.54, 
df=1 and 19). This significant 
interaction reflects a beneficial effect 
of the rest interval that appears in the 


TABLE 6 


SYLLABLES CorRrECT ON 7/12 CRITERION 
TRIAL AND ON THE NEXT TRIAL IN 
Criticav Lists: Exp. II 


Cond. 





NRU 
RU 
NRF 
RF 
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TABLE 7 


CLASSIFICATION OF MEAN RESPONSES PER TRIAL ON CrITICAL Lists INTO Four 
Major GroupinGs: Exp. II 


Omissions 


Intralist Errors | 


Other Overt Errors 





Mean 


SD 


| 
| 





9.53 
9.41 
9.21 
9.44 





Postcrite 





F conditions but does not appear in 
the U conditions. It should be 
noted that the same _ interaction 
appears in the mean differences in 
Exp. 1, but there it is not statistically 
significant. 

The analysis of errors is shown in 
Table 7. In order to give a complete 
picture of the effect of familiarization 
on the nature of the responses, the 
number of correct responses per trial 
is also included in the table. The 
number of correct responses per trial 
is not significantly influenced by 
familiarization (F = 2.91; df = 1 and 
19; P > .05). Neither is the number 
of omissions nor, of course, the com- 
plement of omissions, the total num- 
ber of responses per trial (F = .24, 
df =1 and 19). The main effects 
of the familiarization are to increase 
intralist errors per trial (F = 19.36; 
df=1 and 19; P <.01) and to 
decrease other overt errors per trial 
(F = 28.14; df = 1 and 19; P < .01) 
by roughly corresponding amounts. 
Table 8 shows a breakdown of these 
“other” errors per trial into the 


Precriterion 7/ 


rion 7/15 








various subclasses. We have not 
attempted to analyze the significance 
of the differences here because of the 
extremely small values in some cases. 
However, the major sources of “other” 
errors are obvious: viz., inventions 
(made-up syllables not in any practice 
or critical list) and partial responses. 
It is equally clear that the reduction 
in “other” errors as a result of 
familiarization is directly attributable 
to a reduction in these two types of 
errors. 

To summarize, the effect of famil- 
iarization is two-fold. First reminis- 
cence is produced on the first postrest 


TABLE 8 


CLASSIFICATION OF MEAN “OrHER" Errors 
Per TrIAt in Critica Lists: Exp. II 


Interlist Intrusions 


Partial 
Responses 


Inven- 
tions 


From From 
F Lists | U Lists 


o2 | 
06 | 


From 
P Lists 


01 | .04 
01 | .02 
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trial. This beneficial effect is tran- 
sient, and does not appear in trials 
to criterion. Second, familiarization 
reduces the number of erroneous 
combinations of letters and incom- 
plete responses. In their place, intra- 
list errors tend to occur. While 
Hovland and Kurtz (1952) found the 
same reduction in “other” errors, 
they did not find an increase in 
intralist errors. Perhaps the most 
important reason for this discrepancy 
is that they analyzed mean total 
errors, whereas we have analyzed 
mean errors per trial in order to 
correct for differences in number of 
trials. 


DISCUSSION 


The evidence from the present experi- 
ments shows that familiarization of 
syllables does not affect either their 
association latency or their association 
frequency. Apparently, the familiariza- 
tion effect as shown by increased learning 
speed in the familiarized conditions 
does not depend on the increased availa- 
bility of meaningful associates acting as 
mediators. 

To account for the facilitative effect 
of familiarization in the present experi- 
ments, the following facts, as shown in 
the analysis of Exp. II must be considered. 
Following familiarization, total response 
rate remains relatively unchanged, intra- 
list errors per trial increase, and inven- 
tions and partial responses per trial 
decrease. 

One possible way to account for these 
facts is to assume that the response 
rate is fixed by factors such as instruc- 
tions, the sophistication of the S in the 
situation, and properties of the S such 
as willingness to guess. If familiariza- 
tion does not influence response rate, 
then its main effect, aside from speeding 
learning, is to bring about a redistribu- 
tion of errors. In the U condition, Ss 
make many partial errors and inven- 
tions. During learning, they simultane- 
ously learn what letters go together to 
form the syllables and where syllables 
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go. Hence there is no great build-up 
of intralist errors. On the other hand, 
in the F condition, Ss, on coming into 
the serial learning situation, have already 
learned the letter sequences. Learning 
is consequently more rapid because only 
the position of the items must be learned. 
The fact that the letter sequences have 
already been learned reduces the tend- 
ency to make partial errors and inven- 
tions. Given a constant response rate, 
the inevitable result is a rise in intralist 
errors. It is inevitable because the 
familiarized items are strong but their 
location has not been learned. Accord- 
ing to this view, the rise in intralist 
errors is merely a by-product of the same 
procedure that reduced trials to learn. 

This interpretation is most similar 
to Hovland and Kurtz’ suggestion that 
the effect of familiarization is to supply 
S with a limited pool of responses from 
which anticipations are to be made. 
The increase in intralist errors in the 
familiarized conditions argues against 
the notion that the effect of familiariza- 
tion is to reduce intralist generalization. 
The small number of interlist errors 
in both the familiarized and the non- 
familiarized conditions shows that dis- 
crimination between the two types of 
items, as measured by interlist intrusions, 
is at least not a necessary condition for 
the facilitative effect of familiarization. 

The presence of reminiscence in the 
familiarized conditions of Exp. II points 
to the condition that may be necessary 
for producing this highly elusive phe- 
nomenon, namely, a high degree of inter- 
ference between items. The problems 
that this sort of finding raises for a 
general interpretation of distribution 
effects have been discussed by Riley 
(1957) in another paper. 


SUMMARY 


Two experiments were conducted that 
evaluated the effects of nonsense syllable 
familiarization on: (a) the association value 
of the syllables, (b) the speed of learning 
serial lists, and (c) performance following a 
2-min. color-naming interval after seven 
correct anticipations. Trained Ss were run 
under all conditions. 
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Although familiarization had no effect on 
association value, it did facilitate learning. 
Familiarization also produced both reminis- 
cence and a marked increase in intraserial 
errors. 
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THE EFFECTS OF MOTIVATION AND PROBABILITY OF 
REWARD ON TWO-CHOICE LEARNING! 


PAUL J. WOODS 
Hollins College 


In the present experiment three 
variables are investigated in relation 
to a two-choice task with contingent 
reward. The variables are motiva- 
tion, in the form of an aversive audi- 
tory stimulus, correction vs. non- 
correction procedure, and probability 
of reward. The results will be inter- 
preted in terms of Estes’ statistical 
learning theory (Estes, 1950, 1954; 
Estes & Straughan, 1954). 


METHOD 


Design.—The schedules of probability of 
reward were 98:58, 87:47, and 53:13,? 
which respectively predict the following 
asymptotes in Estes’ theory: .95, .80, and 
.65. The notation used for the reward 
schedules indicates the percentage of times 
reward followed each alternative, and the 
notation used for the asymptotes refers to 
the proportion of times the theory predicts a 
group of Ss will ultimately choose the alter- 
native with the higher probability of reward 
(w,;). The event probabilities were ran- 
1The present paper is adapted from a 
dissertation submitted in partial fulfillment 
of the requirements for the degree of Doctor 
of Philosophy at the University of Connecti- 
cut. The writer wishes to express apprecia- 
tion to his advisor, Howard Brand, for advice 
throughout the course of the research. In- 
debtedness to James M. Sakoda and David 
Zeaman for their many helpful suggestions 
is also acknowledged. 

This research was supported, in part, by 
the National Institute of Mental Health 
under Grant M-814. 

? The reader should note that the difference 
between the probabilities of reward is con- 
stant, but the ratios are not so controlled. 
Brand, Woods, and Sakoda (1956) have 
found that with a constant ratio, increasing 
the difference produced increasing asymptotic 
levels, but with a constant difference, increas- 
ing the ratios produced no consistent influ- 
ence on the asymptotic levels. 


domized separately for each S within blocks 
of 20 trials, so successive blocks contained the 
approximate proportions of the particular 


schedule. For the entire series of trials, 
however, the probability schedule was 
attained. 


Each schedule was used under a correction 
and a noncorrection procedure and three 
conditions of motivation. The motivation 
was varied by means of the intensity of a 
noxious auditory stimulus produced by a 
noise generator. The low intensity noise, as 
measured by a sound pressure level meter, was 
70 + 3 db, and the high intensity noise was 
95 +3db. The medium intensity was chosen 
at 80 + 3 db on the basis of data from five Ss 
whoestimated this level to be halfway between 
the low and high intensity noises. Conse- 


TABLE 1 


ANALYSIS OF VARIANCE OF THE PROPORTIONS 
or CHOICE TO THE MoRE REWARDED 
Sipe 1n Biocks or 20 TRIALS 








Source | df | MS F 
Main-plot analysis 

Proceduref (P) 1 | 0.034! 0.2 
Pxs | 5 |0033| 02 
Motivation (M) | 2/0152) 1.0 

x 2|0.180| 1.2 
SxXM 4 |0.135) 0.9 
PxS*™M | 4/0050) 0.3 
Error | 90 | 0.156 
Total | 107 

~~ analysis | 

eee | legal ag 
BXS .28 | 0.050 4.8* 
BXM | 28/0014) 1.4 
BXPxS 28 |0.010; 1.0 
BXPXM 28 | 0.016! 1.5 
BXSXM 56 |0.010| 1.0 
BXPXSXKM 56 (0.014) 1.3 
Error | 1260 | 0.011 
Over-all Total 1619 | 








*P < 001. 
t Correction vs. noncorrection. 
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ad aa 
GLOCKS OF 20 TRIALS 
Fic. 1. The mean and median propor- 


tions of choices to the high x alternative for 
each schedule (pooled over conditions of 
motivation and procedure). 


quently the physical stimuli were at approxi- 
mately equal psychological intervals. Inten- 
sities greater than 95 db were not used for fear 
of damaging Ss’ ears. 

All possible combinations of the three 
independent variables gives 18 experimental 
conditions. Six Ss were assigned randomly 
to each condition, and they were given 300 
consecutive acquisition trials. 

A pparatus.—A two-choice contingent rein- 
forcement apparatus was used which has 
been fully described elsewhere (Brand, 
Sakoda, & Woods, 1957). The only modifica- 
tion was the addition of a noise generator 
whose output was fed into an ordinary ampli- 
fier and delivered to S through a headset. 

Subjects.—The Ss were 108 undergraduate, 
unpaid volunteers at the University of 
Connecticut. There were 74 women and 34 
men. The Ss were asked if they had any 
auditory impairment; none reported any. 

Procedure.—Details are available else- 
where (Woods, 1957). The task presented 
S was that of trying to outguess E on each 
trial by always pressing the switch he thought 
would give the red light rather than the white 
light. Then E set the electrical connections 
for one of the probability schedules, and S 
was given the green ready signal and the 
aversive stimulus. For the noncorrection 
procedure, if S chose incorrectly, a white 
light flashed and the noise continued. 
For the correction procedure, if S chose 
incorrectly, he was required to make the 
opposite response. The correction response 
was followed by a red light and termination 
of the noise, as were correct responses in 
either procedure. The interval between 
trials was about 2 sec.* 


2A criticism of this procedure might be 
that S is responding only to the instructional 
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BLOCKS OF 20 TRIALS 
Fic. 2. The mean proportions of choices 


to the high « alternative for the correction 
and noncorrection procedure (pooled over 
schedules and conditions of motivation). 


RESULTS 


Over-all analysis.—A split-plot anal- 
ysis of variance was performed on the 
proportions of responses to the high 
mx alternative in blocks of 20 trials 
and is summarized in Table 1. 
First the main effects will be discussed 
and then the higher-order interactions 
will be given. 

Schedules of probability of reward. 
Figure 1 shows a plot of the mean 
proportions of responses to the high 
me alternative in blocks of 20 trials 
for each schedule. The differential 
effect of the schedule of reward 
probabilities is so strong that only 
in the early learning trials do two 
of the curves intersect. 

Procedure (correction and noncor- 
rection).—The mean proportions of 
choices to the most rewarded alterna- 
tive for the correction and noncor- 
rection procedure are shown in Fig. 
2. There is no significant difference 


task of trying to get red lights, and hence 
it is not certain that the noise alone is capable 
of influencing behavior. To answer this 
criticism a small study was conducted with 
a single schedule of reinforcement (87:47), 
nine Ss being run under each noise intensity 
without 


any reinforcement lights. All Ss 
learned to prefer the noise terminating 
alternative and the mean curve did not 


differ significantly from the 87:47 curve of 
the main experiment. 
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OLOCKS OF 20 TRIALS 


Fic. 3. The mean proportions of choices 
to the high « alternative for conditions of 
motivation (pooled over schedules and 
procedure). 


although for 10 out of the 15 points, 
the correction curve is above the 
noncorrection curve. 

Motivation.—The mean proportions 
of responses for each of three levels 
of motivation are plotted in Fig. 3. 
The curve for the intermediate level 
of motivation is initially below those 
for the low and high conditions. 
Such a relationship suggests a non- 
linear trend between motivation and 
the rate of learning. No significant 
difference was found for these condi- 
tions, however, in the analysis of 
variance which tested for mean level 
differences or in the analysis by 
orthogonal polynomials (Grant, 1956) 
which tested for differences in trend. 

To investigate further the possi- 
bility of a nonlinear trend between 
motivation and the rate of learning, 
one can use the rate constants from 
Estes’ prediction equation (Brand 
et al., 1957) as a measure of the rate 
of learning. The values based on 
empirical asymptotes for each of the 
18 conditions are shown in Table 2. 
It may be noted that the relation 
between the rate constant and mo- 
tivation is U shaped in five of the six 
instances. 

Higher-order interactions.—In the 
over-all analysis of variance the 
Blocks X Procedure X Motivation 





' (P = 06). 
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interaction approaches significance 
This interaction shows 
up if the two procedures are plotted 
separately for each motivating condi- 
tion (Fig. 4). The curves for the two 
procedures reverse their relationship 
in the later stages of learning as 
motivation increases. 

The quadruple interaction also ap- 
proaches significance in the over-all 
analysis of variance (P = .07). 

Asymptotic levels—The theory pre- 
dicts asymptotes by means of the 
following formula which has been 
independently derived by Estes (1954) 
and Bush and Mosteller (1955, p. 
288) : 

1 — We 
2- Ti — Fe 


where m; and m2 are the probabilities 
of reward on the more rewarded and 
the less rewarded alternative, re- 
spectively. As shown in Fig. 1 the 
lower curve (53:13) rises consider- 
ably above the predicted asymptote ; 
the middle curve (87:47) is just 
slightly above it; and the upper curve 
(98:58) is a bit below it. It was 
felt that by plotting frequency dis- 
tributions of individual asymptotic 
estimates under each schedule a 
better picture of these relationships 
might be obtained. A_ technique 
that has been used for estimating 


TABLE 2 


Rate Constants (6(2 — 2, — 2)) 




















‘ Motivation 

a a Procedure 
Low Middle High 
98:58 Cor. 018 015 .026 
Noncor. 024 011 021 
87:47 Cor. .036 013 019 
Noncor. 014 015 O17 
53:13 Cor. 015 012 013 
Noncor. .016 .010 016 
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individual asymptotes has been to 
average the same final number of 
trial blocks for each S. It was felt 
that a more valid estimate could be 
obtained, however, if the number 
of trial blocks to be averaged in this 
estimate were determined by an 
inspection of the individual records. 
Accordingly this technique was used 
and the number of trial blocks 
averaged fell within the range of 5 to 
11 for over 80% of the Ss. For those 
records which did not appear to be 
leveling off, the “asymptote” was 
computed by averaging the final two 
or three blocks. 

The distributions were markedly 
skewed in all three cases. The ¢ 
tests comparing the means of the 
individual values with the predicted 
values gave a significant discrepancy 
for the 53:13 schedule (P < .01). An 
analysis of variance of these individual 
asymptotic estimates indicated that 
the schedules produced the only 
significant differences (P < .0005). 
A point of importance here is that 
motivation in this task does not 
markedly raise the asymptotic level. 

To obtain a more sensitive analysis 
of the relationship in the later parts 
of the records, split-plot analyses 
of variance were carried out on the 
final eight trial blocks under each 
motivating condition. The only find- 
ing of interest is that under low 
motivation, trials are not significantly 
different; they are highly significant 
under middle motivation (P < .005) ; 
and they are not significant under 
high motivation. There is, there- 
fore, a U-shaped relationship between 
motivation and the stage of learning 
at which a stable final response level 
is achieved. This finding was re- 
flected in the rate constants shown 
in Table 2. 
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BLOCKS OF 20 TRIALS 


Fic. 4. The mean proportions of choices 
to the high x alternative for the correction 
and noncorrection procedures under each 
condition of motivation (pooled over sched- 
ules). 


DIscuSSION 

Schedules—Increasing the over-all 
probability of reward, when the differ- 
ence between @ values for the two alter- 
natives is constant, produces significantly 
increasing proportions of choices to the 
more rewarded alternative: a result that 
has already been found in previous 
studies. Analysis of the data obtained 
here indicates that with schedules leading 
to high asymptotic levels we obtain very 
skewed distributions of empirical points 
in the later stages of learning. Conse- 
quently, mean curves can profitably be 
supplemented by additional data, such 
as the median curves plotted in Fig. 1, 
to give more information concerning the 
course of two-choice learning in groups 
of Ss. 


Procedure-—Two experimental pro 


cedures were used in this experiment to 
test an assumption in Estes’ model for 
the contingent reinforcement situation, 
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where noncorrection procedure is ordi- 
narily followed. The assumption in- 
volved in the application of the model to 
the noncorrection procedure is that when 
S is told he has made an incorrect re- 
sponse, he covertly makes the correct 
one. Specifically, there is an assumption 
in the derivation of the prediction equa- 
tion that a wrong response to one alter- 
native leads to conditioning the stimulus 
elements present to the opposite alterna- 
tive, and thereby increases the proba- 
bility that the opposite alternative will 
be chosen on subsequent trials. In the 
correction procedure, however, whenever 
S responds incorrectly, he immediately 
makes the alternative response overtly 
and gets rewarded. If the assumption in 
the noncorrection procedure is warranted, 
then there should be no difference in 
acquisition under the two procedures. 
No significant differences were found in 
the present experiment between the two 
procedures. It seems, therefore, that 
this assumption is tenable, at least when 
other variables which may possibly be 
producing interaction effects are pooled. 

The Trials X Procedure X Motiva- 
tion interaction approaches significance, 
however, and the results as plotted in 
Fig. 4 do merit consideration. In the 
final half of the records, the acquisition 
curves for the two procedures completely 
reverse their relative positions as motiva- 
tion increases. It is possible that real 
effects are being obscured here and else- 
where by the large amount of variation 
among Ss in the same experimental 
conditions. 

Motivation.—Motivation in the form 
of a noxious auditory stimulus had no 
unequivocal effect on the rate of learning. 
Several aspects of the data suggest a 
curvilinear relationship. Itisinteresting 
to note, however, that this U-shaped 
relationship is just the opposite of the 
inverted U-shaped relationship between 
motivation and efficiency which has been 
mentioned in many textbooks. (See, for 
example, Wickens & Meyer, 1955, p. 
137.) Of course, the noxious stimulus in 
the present experiment may have in- 
fluenced certain aspects of S’s behavior 
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which were not measured, such as the 
speed of responding. And further, sptc- 
ulating on an ad hoc basis, it may have 
stimulated an aspect of behavior which 
usually leads to greater efficiency, viz., 
ideas or ‘“‘hypotheses”’ about the problem 
at hand. This very stimulation in this 
sort of task does not necessarily lead to a 
greater proportion of choices to the high 
m alternative. It leads S to try various 
hypotheses, such as notions about the 
patterning of reinforcements, etc., and 
thereby increases the number of choices 
to the low w alternative. This may then 
be the reason for the U-shaped relation- 
ship suggested by the present experiment. 


SUMMARY 


The purpose of the experiment was to 
investigate the relationship between behavior 
in a two-choice learning task and three experi- 
mental conditions: motivation, in the form 
of an aversive auditory stimulus, correction 
and noncorrection procedure, and schedule 
of contingent reward. In all there were 
three levels of motivation, two experimental 
procedures, and three schedules of probability 
of reward. The acquisition data of the six 
Ss run under each of the conditions were 
compared and the results were further 
analyzed in the context of Estes’ statistical 
learning theory. The main effect was due 
to the probability of reward. The other 
variables enter into the process to a lesser 
degree. The data suggest the relationship 
between motivation and rate of learning 
may be U-shaped in nature. 
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AGE DIFFERENCES IN 


THE ACQUISITION AND 


EXTINCTION OF THE CONDITIONED 
EYELID RESPONSE ! 


HARRY W. BRAUN anp 
University 


This study was stimulated in part 
by a report of Gakkel and Zinina 
(1953) that Ss over 60 years of age 
required a longer period for condition- 
ing of the eyeblink and also for its 
extinction. It was also performed 
to provide systematic information 
on the conditioning and extinction 
performance of children, young adults, 
and old adults under the same con- 
trolled conditions. Apparently this 
problem has been hitherto unexplored. 


METHOD 


Subjects.—These were 15 boys in the age 
range 8-10 yr., with a mean age of 9.36 yr.; 
15 male young adults (college students) 
aged 18-25 yr. whose mean age was 20.63 yr.; 
and 13 male adults whose mean age was 
70.5 yr., range 62-84 yr. The old adults were 
noninstitutionalized and were either employed 
full time or part time, or retired. All Ss were 
volunteers and some of the elderly group 
were compensated for their services. 

Apparatus and method of recording.—The 
equipment for recording the eyelid closure 
and for presenting the CS and UCS was 
similar to that used by Spence (1953), and 
will not be described here. The S sat in a 
dental chair in a darkened room adjoining 
that in which the stimulus controls and re- 
cording apparatus were located. He was 
instructed to blink on receiving a ready 
signal, and then to fixate on a 6-cm. circular 
milk glass disc placed at a distance at 125 cm. 

The CS was an increase in the brightness 
of this disc from 0.05 to 1.5 apparent ft.- 
candles. The duration of the CS on each 
trial was 1 sec. The UCS was a puff of air 
with an intensity of 2.0 lb./sq. in. delivered 
to the right eye, with a duration of 500 msec. 

' This investigation was supported in part 
by a research grant, M-1365, from the Na- 
tional Institute of Mental Health, U. S. 
Public Health Service. 
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The CS-UCS interval was 500 msec. All Ss 
were given 80 conditioning trials, on each of 
which the light and air puff were paired. 
Intertrial intervals of 10, 15, and 20 sec. 
were used according to a prearranged sched- 
ule. Following the last conditioning trial, 
20 extinction trials were given, in which both 
light and air puff were paired, but in which 
the CS-UCS interval was lengthened to 1500 
msec. Only those CR’s whose latency was 
300 msec. or more were included in the data. 
No Ss were excluded as voluntary responders, 
i.e., Ss who gave at least half of their CR’s 
with a latency of less than 300 msec. 


RESULTS 


Curves of acquisition of the CR 
for the three age groups are shown 
in Fig. 1 in terms of the percentage 
of CR’s in each block of 10 trials. 
Through the 60th conditioning trial, 
the curve for the children is consist- 
ently above that of the young adults, 
and it is above that of the elderly 
group throughout the entire 80 trials. 
The conditioning performance of the 
young and old adults is_ similar 
through the first 20 trials after which 








#04 OMLORENR—----* 
YOUNG aU. TS ——~ 
7" OLO au T$-——— 
A, 7 
w+ , a” 
ar AY 
4 an 
7 yl tin 
3 404 P \ 
rs ‘ 
307 i 4 \ 
~ i ~ ° 
OF 
,-. aon se + oS ee 
CONDIT ExTwction 
BLOCKS OF TEN TRS 
Fic. 1. Conditioning and extinction as a 


function of age. 


386 





CONDITIONED EYELID RESPONSE 


the young group improves markedly 
and the elderly group only slightly. 
The conditioning curve of the children 
is negatively accelerated, that of the 
young adult group is positively ac- 
celerated, while the curve of the old 
adults is flat. At the start of condi- 
tioning, the performance level of the 
children is relatively high and/or 
the rate of conditioning is high. On 
the other hand, the performance level 
of the other two age groups at the 
beginning of conditioning is relatively 
low and/or the rate of conditioning 
is slow. 

Statistical evaluation of these re- 
sults was made by the Mann-Whitney 
test. The difference between the 
number of CR’s made by the children 
and the old adults was significant 
beyond the .002 level for a two-tailed 
test (U = 23) and the difference be- 
tween the number of CR’s made by 
the young adults and the old adults 
was also significant beyond the same 
level of confidence (U = 25). The 
children and young adults did not 
differ significantly in the number of 
CR’s (U = 89). 

Curves of latency of the CR’s in 
blocks of 20 trials for the three age 
groups appear in Fig. 2. These data, 
which are measures of latency of CR 
to the onset of the CS on the 80 
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Fic. 2. Latency of CR as a function of age 
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acquisition trials, show that during 
the first 60 trials, the CR’s of the 
children are slower than those of 
the other two groups. During the 
last 20 conditioning trials, however, 
the three age groups do not appreci- 
ably differ on this measure. The 
sharp decrease in latency shown by 
the elderly group in the third block 
of twenty trials was caused by two 
new responders. 

The use of the Mann-Whitney 
test showed that the children made 
significantly more CR’s during extinc- 
tion than the elderly adults (U = 27; 
P < .002) as did the young adults 
(U = 49, P = .05). The two younger 
groups did not differ significantly 
in this respect. However, after anal- 
ysis of covariance, age differences in 
extinction were not significant. 


DISCUSSION 


The main and striking finding of this 
study is the relative inability of the 
elderly .Ss to acquire the conditioned 
eyeblink response. Of the 13 Ss in this 
group, four Ss gave no CR’s during the 
80 conditioning trials, seven gave from 
one to eight CR’s, while 37 and 75 CR's 
were given by the other two Ss. There 
were no instances of failure to condition 
in the two younger groups. This result 
confirms the report of Gakkel and Zinina 
(1953) although in their study data on 
the conditioning performance of younger 
Ss were not included. To account for 
the failure of older Ss to acquire CR’s, 
Gakkel and Zinina quote Pavlov as 
coming to the conclusion that “liveliness 
of the nerve processes suffer(s) from the 
development of senile changes.”” They 
also invoke the construct of “‘inertia’’ of 
the stimulating process at senile age. 

However, several considerations indi- 
cate that this hypothesis may not be 
entirely satisfactory. The latency of the 
CR’s of the elderly group are not longer 
than those of the other groups. It is 
admitted that these are tenuous data 
since they involve only a very small 
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number of responses. However, Weiss 
(1956) has shown that the latency of the 
retina as reflected in the retinogram does 
not change with age. His Ss were 15 
males aged 66 to 76 yr. and 10 between 
the ages of 18 to 37 yr. Similarly, 
Kumnick has reported that “the effi- 
ciency of the pupillary mechanism in re- 
lation to similar original diameter in the 
response to light stimuli did not decrease 
with increasing age’’ (Kumnick, 1956, 
p. 160). Thus, activation time of at 
least these aspects of the visual organ 
seems to be unchanged with age. It 
would be premature to conclude that the 
latencies of all simple involuntary re- 
sponses or reflexes do not change with 
age. Magladery, Teasdall, and Norris 
(1958) have shown that the latencies 
of plantar flexor responses and superficial 
abdominal reflexes are markedly slower 
in elderly Ss and children than in young 
adults. 

The authors suggest the consideration 
of an adaptation hypothesis to account 
for the relative unconditionability of 
elderly Ss. Both MacDonald (1946) 
and Taylor (1956) have reported that 
Ss who were adapted to the UCS (air 
puff) show a significantly lower level of 
conditioned eyelid responses than Ss who 
were not given preadaptation trials. It 
is proposed that in the course of many 
years of living, the eyelid response as well 
as probably other responses have been 
“adapted out” and thus less susceptible 
to subsequent conditioning. Research 
is under way to study several conse- 
quences of this hypothesis: that both 
frequency and amplitude of blink are 
reduced as a function of age. The 
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procedures employed in the present study 
did not provide data on these variables. 


SUMMARY 


This study investigated eyelid conditioning 
and extinction of children, young adults, and 
old adults under the same controlled condi- 
tions. All Ss were given 80 conditioning 
trials and 20 extinction trials. The children 
and young adults gave significantly more 
conditioned responses than the old adults. 
Elderly Ss were relatively unable to acquire 
the conditioned eyeblink response. Latency 
of response and extinction were not related 
to age. 
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OF 


PHYSIOLOGICAL CORRELATES 
HUNGER AND 


PAIN! 
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This study is an attempt to identify 
several specific response patterns me- 
diated by the autonomic nervous sys- 
tem (ANS) in the same group of Ss. 
The test conditions were: (a) graded 
periods of food deprivation, and (db) 
graded durations of foot immersion 
in iced water. The iced water condi- 
tion was chosen because of its well- 
known pain-producing effect. 

It has been shown that regions exist 
in the hypothalamus which are related 
to eating behavior. First to be 


described was the so-called satiety 
center (Brobeck, Tepperman, & Long, 
1942) in the medial hypothalamus; 
and, subsequently a region less clearly 
demarcated in the lateral hypothal- 


amus has been characterized as the 
hunger center (Anand, Dua, & Shoen- 
berg, 1955). Since the hypothalamus 
has also been identified as an integrat- 
ing center for responses mediated 
through the ANS, it would seem 
reasonable to expect different ANS 
patterns related to varying periods 
of food deprivation. 

Several studies have clearly shown 
that painful stimuli have autonomic 
effects (e.g., Cannon, 1929; Wolff & 
Wolf, 1951); and one study (Wolf & 
Hardy, 1941) attempted to separate 
the cooling effect from the pain- 


! This research was supported, in part, by 
USPHS Grant M-788. This paper is taken 
from a thesis submitted to the Department of 
Psychology, University of California at Los 
Angeles, in partial fulfillment of the require- 
ments for the Ph.D. degree. The author 
wishes to acknowledge his appreciation to 
M. A. Wenger for his assistance and guidance. 

2 Now at Cardiovascular Research Insti- 
tute, University of California School of 
Medicine, San Francisco 22, California. 


producing effect of the cold-pressor 
test. 


METHOD 


Independent variables.—Four levels of 
hunger and of pain were investigated. De- 
grees of hunger were produced by depriving 
each S of food for 2, 10, 14, and 24 hr., the 
first being regarded as a control period. Ten 
hours of deprivation was chosen because a 
shorter period would have interfered with the 
night's sleep of some Ss (all experiments were 
conducted during the morning), and a longer 
period would have been too similar to the 
14-hr. deprivation. The latter was chosen 
because of its convenient relationship to the 
normal dinner hour. 

Pain was produced by means of the cold- 
pressor test. This test requires that S im- 
merse an extremity (in this case the right foot, 
approximately to the level of the internal 
malleolus) into a container of iced water (in 
this study the temperature was maintained 
between 3° and 4°C.). Varying levels of pain 
supposedly were produced by varying the 
immersion time; zero (the control period), 
2, 3, and 4 min. were used. 

In order to relate the intensity of the 
stimuli to subjective impressions, each S was 
required to rate each stimulus in two ways. 
First, he rated it after each session by desig- 
nating the stimulus intensity as zero, mild, 
moderate, or severe. Next, after the second, 
third and fourth sessions of each stimulus he 
was asked to compare the stimuli among 
themselves and to order them in increasing 
magnitude. 

Dependent variables.—-The apparatus used 
in this study has been fully described by 
Wenger, Engel, & Clemens (1957), and 
Wenger, Henderson, & Dinning (1957). 
Table 1 summarizes the functions recorded, 
units of measurement, sampling frequency, 
and method of assay. 

Experimental design.—I\n order to counter- 
balance the stimulation periods, latin square 
designs were used. There were four squares 
for the hunger conditions and four separate 
squares for pain. Each of the four squares 
within each set was randomly selected; and 
S was randomly assigned to the rows of the 
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TABLE 1 





Function 


Recording 
Frequency 





Method of Assay 





Hunger 





Once per min. 


ms Continuously 
(micromhos) 


Finger, face, axillary | °C 
and room tempera- 
tures 


Heart rate cycles/min. Continuously 


Breathing rate cycles/min. Continuously 
microliters/ 


stroke /min. 


Finger pulse volume Continuously 


Presence or ab- | Continuously 
sence; sec/cycle 


of Type I waves 


Stomach motility 


Rate of salivary flow | microliters/min. Continuously 








Finger twice and 
face and axillary 
temperatures once 
every 10 sec. and 
room temperature 
at beginning of re- 
cording session 


-| Average of last 5 read- 
ings 


Mean rate during last 
5 min. of recording 
session 


Same as heart rate 


Mean of 5 modal am- 
plitudes during last 5 
min. of recording ses- 
sion 


Mean period of waves 
(when present) 
throughout entire re- 
cording session 


Mean rate during last 
5 min. of recording 








Average of last 2 read- 
ings 


Mean level attained based on samples taken at 
the same times as the blood pressure samples 


Mean level attained based on samples taken at 
the same times as the blood pressure samples 


Mean rate during last 
min. of recording 
n 


Same as heart rate 


Mean of 2 modal am- 
plitudes during last 2 
min. of recording ses- 
sion 


Mean period of waves 
(when present) before 
and during stimula- 
lation 


Mean rate during last 
2 min. of recording 





squares. In order to reduce the probability 
of a significant “order” effect, sophisticated 
Ss were used. Since Wenger (1948) has 
shown certain autonomic variables and time 
of day are correlated, this effect was con- 
founded with the rows (Ss) and the experi- 
mental times restricted to three morning hours 
(9:00, 10:00, or 11:00 a.m.). Each S was 
assigned a given sequence of pain and of 
hunger and the two sets of situations were 
interspersed at random. If B, A, C and D 
represent the sequence of cold-pressor tests, 
and 4, 3, 1 and 2 the sequence of deprivation 
sessions for a given S, his run might be: 
B-A-4-C-3-D-1-2. In order to control for any 
cumulative effects, at least 72 hr. intervened 
between any two recording sessions. 

Each experimental session covered ap- 
proximately 45 min. Twenty min. were re- 
quired to connect the leads to S and make the 
necessary equipment adjustments. There 
followed 10 min. of recording during rest, and 
5 min. of critical recording. (In the case of 
pain, only the last 2 min. were analyzed.) 
The 10-min. period was designed to allow S 
to adapt to the experimental environment and 


the data it afforded were not analyzed except 


in the case of stomach motility (Table 1). 

Subjects.—The Ss were 16 male psychology 
graduate students ranging in age from 23 to 
40; the mean age was 28.3 yr. 


RESULTS 


Hunger.—The rating scales were 
used to determine the relationship be- 
tween periods of deprivation and sub- 
jective impressions, and to determine 
whether there existed a hierarchical 
order of subjective sensation cor- 
responding to the deprivation periods. 
The concept of hunger was never 
specified for S. When he asked for a 
definition, he was told to use whatever 
criteria he considered relevant in light 
of his own experiences. 

The coefficient of concordance was 
chosen to give an index of the degree 
of consistency among Ss’ rankings of 
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the deprivation periods. In order to 
reduce the two sets of ranks to a 
single matrix, they were added, re- 
ranked, and the ties broken by coin 
tosses. The mean ranks for the 2-, 
10-, 14-, and 24-hr. deprivation condi- 
tions were: 1.1, 2.2, 3.0, and 3.7, 
respectively. The coefficient of con- 
cordance, .631, was significant beyond 
the .01 level; and since the difference 
between any consecutive pair of 
deprivation period means was ap- 
proximately the same, the conditions 
are probably graded in_ intensity. 
Taken together, these results are the 
bases for the conclusion that the 
increases in deprivation period used 
in this study produced increments in 
hunger, when the latter is defined 
as a sensation. 

The effects of food deprivation on 
each of the physiological functions 
tested are reproduced in Table 2. 
Table 3 shows the results of the analy- 
ses of variance for those physiological 
variables showing significant changes. 


TABLE 2 


MEAN LEVELS OF MEASURED ANS 
VARIABLES UNDER Four Foop 
DEPRIVATION CONDITIONS 


Deprivation Period 





Systolic blood pressure 421. .0 | 118. 0| 1 
Diastolic blood pressure | 79.5| 80.1 | 


Palmar conductance 
(log X 104) 

Respiration rate 

Face temperature 

Fi pulse volume 

Axi temperature 

Finger temperature 


* Cf. Table 1 for units of measurement. 


An analysis of room temperature 
fluctuations was included because 
previous studies have shown that they 
can influence some autonomic vari- 
ables (Wenger, 1948). The mean 
room temperatures during the depri- 
vation sessions ranged from 24.1° C. 
to 24.4° C.; the F was insignificant. 


TABLE 3 


ANALYSES OF 


VARIANCE OF Data CoLLecteD UNpER Four ConpITIONS 


oF Foop DEPRIVATION* 


Blood Pressure 


Source Systolic 


Food Deprivation | 
Linear 
Quadratic 
Cubic 


Order 


43.61 | 2.25 
3.14 

124.04 | 6.39" 
3.63 


1.75 





| 
| 
| 9.01%" 


Ss 192.53 


Error 16.68 


Total 


* The onaivans « of variance for all functions appeared in Vol. 4, No. 1, 
ined by aaiting to: Albert F. Ax, Scribe; Psychophysiology Newslir.; 


Dittoed copies of the raw data may be o' 
Lafayette Clinic; 951 E. Lafayette; Detroit 7, Michigai 


Pulse 


| 
1.89 42.8) 1.81 


135.21 | 8.11** | 220.3 | 9.33%" 


Axillary 


- H | 
Breathing Rate | Temp. 


Salivary Flow» 


273 | 
612 | 

‘051 

153 
6.10) 4.04* 037 
| 96.98 | 64.22%" | 231 


23.6 1.51 125 


1958 of the Psychophysiology Newslir. 


> Gap between the 24-hr. fast and the 10-hr. fast Tepiicant beyond the .01 level; gap between the 24- and 2-hr. 
fasts significant beyond the .0S level; gap between the 10- and 14-hr. fast approaches significance at the .0S level 


(gap = 83; Rp = 843). 
*P = 05. 
=P = 01. 

oP = 001. 
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The F's for pulse pressure, respiration 
rate, and salivary output are statis- 
tically significant (.05 level or greater) ; 
however, only the means for the 
last showed significant gaps when 
Duncan's test (1955) was applied. 
Twenty-four hours of deprivation 
produced greater salivary flow than 
either 10 or 2 hr., and the 14-hr. 
deprivation condition produced more 
salivary flow than did the 10-hr. 
deprivation. Some of the original 
data were markedly skewed and re- 
quired correction before analyses of 
variance tests could be applied. Sal- 
ivary outputs and finger pulse volumes 
were transformed to square roots, 
skin resistances to log conductances, 
and finger temperatures to log (37°- 
finger temperature). In the last case, 
the data had to be translated because 
they were negatively skewed. All 
transformations were determined on 
the basis of visual inspection of the 
frequency plots. 

In all cases in which the sums of 
squares for deprivation were at least 
four times as great as the mean 
square for error (for 1 and 40 df, 
F 95 = 4.08), the deprivation effects 
were tested to determine whether 
they had statistically significant lin- 


TABLE 4 


MEAN LEVELS OF MEASURED ANS VARIABLES 
DurinG Four Exposures 
To IceED WATER 


Exposure Time (Min.) 
Variables 


Systolic blood pressure 
Diastolic blood pressure 
Pulse pressure 

Heart rate 

Rate of salivary flow 
Log conductance (x 10*) 
Respiration rate 

Face temperature 
Finger pulse volume 
Axillary temperature 
Finger temperature 














* Cf. Table 1 for units of measurement. 


‘have linear components. 
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ear, quadratic, and cubic components 
(Table 3). In two instances (systolic 
blood pressure and axillary tempera- 
ture) the trend components were 
significant when the deprivation ef- 
fects were not. While data such as 
these must be regarded with some 
skepticism, they cannot be ignored. 
Both systolic blood pressure and 
pulse pressure have significant quad- 
ratic trends, whereas the other three 
The saliva-. 
tion data may have a quadratic com- 
ponent, however, since the gap between 
the 10- and 14-hr. deprivation periods, 
.83, falls just short of significance at 
the .05 level (.843), and the gap 
between the 2- and 10-hr. periods, .74, 
approaches significance (R.o5 = .801). 
The F for quadraticity, 3.12, is well 
short of that required for significance 
(Fo5 = 4.08). 

The data for stomach motility were 
not analyzed statistically, since their 
form precludes the use of most 
standard statistical procedures. In 
the 2-, 10-, 14-, and 24-hr. deprivation 
conditions 9, 7, 10, and 11 Ss showed 
Type 1 waves,’ respectively, and the 
mean periods of these waves were 
19.9, 19.4, 19.5, and 19.0 sec., respec- 
tively. 

It can be seen that the incidence of 
activity increased with increasing 
food deprivation except for the 10-hr. 
fast ; and that the mean period of con- 
traction decreases as deprivation time 


*The most commonly accepted termin- 
ology for designating stomach contractions 
according to type is that given by Code, 
Hightower, and Morlock (1952). Type | 
waves are those with periods ranging from 
about 5 to 20 sec.; Type II waves range in 
period from 12 to 75 sec. ; and Type III waves 
cover about 3 to 5 min. Type II waves also 
have approximately 3-fold greater amplitudes 
than do Type I waves. Probably some of the 
waves called Type I are Type II, since only 
wave period is being measured; however, the 
greatest number of contractions are undoubt- 
edly Type I. 
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increases, although the 10- and 14-hr. 
periods are probably indistinguishable. 

Pain.—The subjective data for pain 
were combined to form a single 
matrix. The Ss were told to use 
whatever criteria they considered 
relevant in the light of their own ex- 
periences to define pain. The mean 
ranks for the 0-, 2-, 3-, and 4-min. 
immersions were 1.0, 2.7, 3.0 and, 3.2 
respectively. The coefficient of con- 
cordance, .627, was significant beyond 
the .01 level. However, the intervals 
among the ranks are highly unequal, 
and in fact, the 2-, 3-, and 4-min. 
conditions were about equally rated. 
Thus there probably is little basis for 
concluding that the different exposure 
periods produced graded pain. There 
seems little doubt, however, that 2-, 
3-, or 4-min. exposure to iced water 
does produce the sensation of pain, 
since all Ss agreed that these periods 
were painful. 
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It should be noted that there was 
a significant room temperature bias 
of the “order effect.” Since the 
“cold-pressor effect’’ is insignificant 
and orthogonal to the ‘order effect,” 
no corrections were applied. The 
mean room temperatures during the 
four cold-pressor sessions ranged from 
24.1° C. to 24.4° C. Another inter- 
esting finding was that the data which 
required correction for skewness in 
the hunger analyses required the same 
corrections for pain. Thus rate of 
salivary flow and finger pulse volume 
changes were transformed to square 
roots, skin resistances to log con- 
ductances, and finger temperatures 
to log (37°-finger temperature). 

Table 4 shows the effect of pain 
stimulation on the various autonomic 
functions and Table 5 the results of 
the F tests for those functions showing 
significant change. 
tolic blood 


Systolic and dias- 


pressures, finger pulse 


TABLE 5 


ANALYSES OF VARIANCE OF THOSE RESPONSES SHOWING SIGNIFICANT CHANGE 
DurinG Four Pertops or Exposure To IcepD WATER* 


Blood Pressure 


Source Systolic 


MS F 


488.57 | 

1097.05 | 14.98*** | 

368.16 | 5.03* 
49 . 


Cold Pressor 3 | 

Linear 
Quadratic 

Cubic 


Order 41.67 


3.26** 


| 
| 
Ss | 238.98 
| 


Error 42 73.24 


Tota 63 


* See Footnote a, Table 3. 


MS F 
6.67*** | 533.35 | 11.22*** | 177.31 | 3.93* 
724.50 
830.16 
45.38 
13.10 
221.66 4.66*** | 179.00 | 3.97*** 


47.54 | 


Finger Pulse 


Heart Rate Volumes 


Diastolic» 


MS F MS 
363 
220 

| .766 
105 


15.24*** | 


62.48 | 1.38 
17.46°** | 


73.53 | 1.63 
395.60 | 8.77** 


31.58 033 


A61| 7.43°** 


45.09 .062 


> Gap between the control and 3-min. exposure significant beyond the .05 level (gap = 12.5; Rp = 12.3 at the 


05 level. 
¢ Error df = 38, total df = 59. 
*P = OS. 
—-P = O11. 
7 P = OO. 
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volume changes and heart rate all had 
significant F’s, and there was a sig- 
nificant gap (Duncan, 1955) between 
the diastolic blood pressure for the 
control session and the 3-min. ex- 
posure. All those functions which 
yielded significant F's also had sig- 
nificant trend components. Systolic 
and diastolic blood pressures had both 
linear and quadratic trends; finger 
pulse volume had a quadratic trend 
only; and heart rate showed a cubic 
component. There were 27 instances 
during which stimulation was preceded 
by some evidence of gastric motility. 
During 15 of these exposures, motility 
was completely blocked; in one S 
motility was induced by the stimulus 
on two occasions. The mean period 
for the 0-, 2-, 3-, and 4-min. conditions 
was 19.8, 19.4, 17.0, and 19.0, respec- 
tively; however, it should be noted 
that the stimulus means are less 
reliable than the prestimulus data; 
the N for the control condition was 
12, whereas the 2-, 3-, and 4-min. 
means are based on N’s of 3, 5, and 6, 


respectively. 


DIscussION 

Hunger.—Because of the parallel trends 
between the pulse pressure and systolic 
blood pressure curves, it seems probable 
that the pulse pressure effect is due 
essentially to changes in systolic pressure. 
It should be noted, however, that the 
diastolic pressure changes cannot be 
wholly ignored for these must have co- 
varied with the systolic pressures thereby 
making pulse pressure changes signifi- 
cant. One of the most interesting find- 
ings is that many of the significant 
changes which occurred are attributable 
to the 10-hr. deprivation period: pulse 
pressure was least, systolic pressure 
lowest, and salivary flow slowest under 
this condition. In the case of the blood 
pressure changes, the 24-hr. deprivation 
effects were indistinguishable from those 
of the control period. The rate of 
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salivary flow, however, increased fairly 
regularly with increased deprivation, 
after the initial drop. Axillary tem- 
perature, respiration rate, and the period 
of Type 1 stomach contractions all 
showed fairly linear changes to the 
deprivation periods. The first mani- 
fested a gradual increase as a function 
of deprivation time, while the last two 
showed progressive decreases. 

The finding that axillary temperature 
increases with increased food deprivation 
has a bearing on Strominger and Bro- 
beck’s (1953) thermodynamic theory. 
It would seem that they would predict 
just the opposite effect. However, it 
should be remembered that while axillary 
temperature showed a statistically signifi- 
cant linear trend, the differences among 
the means for the four conditions were 
not significant. On the other hand, 
Booth and Strang’s (1936) data, which 
Strominger and Brobeck cite as support 
for their position, were based on tem- 
perature levels of the hands and feet. 
These probably more accurately reflected 
blood flow under the sensing element 
than body temperature per se. In the 
present study there were no differences 
between face and finger temperatures 
during the control tests and the various 
food deprivation conditions. A critical 
test of Strominger and Brobeck’s hy- 
pothesis would result from measurement 
of temperature changes in the hypo- 
thalamus itself. 

The breathing rate effects are con- 
sistent with Cannon's (1929) hypothesis 
that increased hunger results in height- 
ened activity of the cranial division of 
the ANS. Likewise, the 10-hr. drops in 
systolic blood pressure and the 14- and 
24-hr. increases in rate of salivary flow 
could be attributed to parasympathetic 
(PNS) hyperactivity; and the increase 
in incidence of Type 1 stomach waves is 
indicative of heightened vagal activity. 
But certain of the data do not support 
the thesis that hunger is univocally 
associated with heightened activity of the 
cranial ANS. The 10-hr. decrease in 
salivary flow, the reversal in trend of 
systolic blood pressure, the progressive 
increase in body temperature, and the 
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decreased period of gastric contractions 
could possibly be attributed to height- 
ened SNS activity. These data point to 
two conclusions: (a) that the cranial 
ANS is not the only factor operating to 
alter peripheral autonomic responses 
during hunger; (b) that the relationship 
between the intensity of hunger and level 
of activation of specific autonomic vari- 
ables is not linear. 

In order to account for the sympa- 
thetic-like effects in the presence of some 
indices of increased PNS activity, it 
seems necessary to infer some increase in 
level of activity of (a) portions of the 
SNS, (6) the adrenal medullae, or both. 
Which factors are actually operant must 
be the subject for future research. 

The subjective data point to the con- 
clusion that there was, in general, an 
increase in hunger with increased de- 
privation time. If increases in intensity 
of hunger produce concomitant increases 
or decreases in the level of activation in 
any branch of the autonomic nervous 
system, one should expect monotonic 
trends in all functions mediated by that 
branch. Instead, it was observed that 
the pressor responses had quadratic 
rather than linear trend components. It 
therefore follows that the pattern as well 
as the level of autonomic response differs 
for different periods of food deprivation. 

Pain.—The major effects found under 
cold-pressor stimulation were cardio- 
vascular changes. Both systolic and 
diastolic pressures increased about 
equally, thus leaving pulse pressure un- 
affected; and there were increases in 
heart rate and peripheral resistance. 
The fact that the stimuli blocked gastric 
contractions in over 50% of the cases and 
induced it twice are also pertinent results. 

The finding that the cold-pressor test 
will produce a significant pressor response 
is, of course, not new; nor is it surprising 
that this effect is mediated through 
vasoconstrictor and _ cardio-accelerator 
reflexes. More interesting are some of 
the statistically insignificant results: 
(a) that skin conductance level is not a 
function of painful stimulation; (b) that 
increased breathing rate is not a neces- 
sary consequence of painful stimulation; 
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and (c) that there is not a significant drop 
in finger temperature with such stimu- 
lation. 

There are several factors that could 
account for these apparent discrepancies. 
One is that the quality of stimulation 
determines the nature of the response. 
Thus there might be different patterns 
of response associated with different 
pain-producing stimuli. A second possi- 
bility is that the effects others have 
attributed to painful stimuli may be 
associated with startle at the onset of the 
stimulus rather than with the stimulus 
per se. If this is true, there may be a 
generalized response to the onset of 
stimulation, out of which develops more 
specific response patterns to specific 
classes of stimulation. It should be 
pointed out that the measure of skin 
conductance used here was the level near 
the end of the stimulus period (cf. 
Table 1) rather than the maximum 
change. However, in a number of in- 


stances there was no change in skin 
time during 


resistance at any the 
stimulation period. 

In view of the dissimilarities in trend 
among the various responses, it appears 
that different durations of cold-pressor 
stimulation resulted in different patterns 
of autonomic activation despite the fact 
that the subjects did not clearly dis- 
tinguish differences in pain among the 
three durations of exposure. One pos- 
sible explanation of the apparently dis- 
similar patterns is that some adaptation 
was taking place. Table 4 shows this 
effect occurred in the pressor responses 
and in finger pulse volume changes. 
Heart rates also tended to decrease 
during the longer exposure periods. 


SUMMARY 


This study reports the effects of 2, 10, 14, 
and 24 hr. of food deprivation (hunger) and 
of 0, 2, 3, and 4 min. of cold-pressor stimula- 
tion (pain) on 12 autonomically mediated 
responses of 16 male Ss. It was observed 
that increased food deprivation led to: (a) de- 
creases in pulse pressure which probably were 
primarily a result of decreases in systolic 
blood pressure ; (b) linear increases in axillary 
temperature and salivary output; (c) a linear 
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decrease in respiration rate; (d) increased 
incidence of Type 1 stomach contractions; 
and (e) decreased period of Type 1 con- 
tractions. 

Cold-pressor stimulation resulted in: (a) in- 
creases in systolic and diastolic blood pres- 
sures; (b) increases in heart rate; (c) decreases 
in finger pulse volume; (d) some blocking of 
Type 1 stomach contractions (56% of the 
cases); or (e) decreased period of Type 1 
contractions. 

It was concluded that: (a) the thermo- 
dynamic theory of hunger was not supported ; 
(b) the relationship between intensity of 
hunger and level of autonomic response is not 
linear; (c) the pattern of response to immer- 
sion in iced water varied with the duration of 
immersion even though Ss did not report any 
differences in pain among the immersions. 
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In the past (McGeoch & Irion, 
1952) retroactive interference (R]) 
had been considered the major con- 
tributor to forgetting. A recent re- 
view of the literature, however, had 
led to the implication that one prin- 
cipal influence heretofore ignored is 
actually proactive interference (Un- 
derwood, 1957). The function ob- 
tained by Underwood through his 
collation of the data from classical 
studies on forgetting indicates that 
approximately 85% of the effect, 
when corrected for artifactual recall 
limit, can be directly attributable to 
the number of previous lists (PI) 
given. Although not specifically con- 


sidered by Underwood, the relative 


importance of such a function for the 
laboratory study of forgetting, how- 
ever, rests in its relation to a com- 
parable RI curve and a combined RI 
and PI curve. Since the above noted 
results were obtained from studies 
having no formal RI material, the 
question remains of determining with- 
in the formal experimental setting, 
the relative contributions of both 
sources of interference. Only one 
previous classical study (Lester, 1932) 
presents any indication of a possible 
summative effect of RI and PI. 
However, the result was inadvertent 
to the main purpose of the study, and 
no systematic investigation of com- 
binative RI and PI effects was at- 
tempted. It is this problem of con- 
current effects which the 
study investigated. 


present 


1Now at Human Factors Division, Re- 
search Directorate, Kirtland Air Force Base, 
New Mexico. 


METHOD 


Subjects.—A sample of 90 Ss was used in a 
3 X 3 factorial design. The Ss were mostly 
undergraduate summer school students taking 
introductory psychology. The remainder of 
the sample included some students of an 
advanced psychology course as well as a few 
departmental secretaries. The students in 
introductory psychology were given credit 
for a laboratory report, but the other Ss 
volunteered without reward. 

Apparatus.—A 35-mm. Revere automatic 
slide projector was used to present nonsense 
syllables at a rate of 4 sec. per syllable. 
There was a 2-sec. intersyllable interval. The 
nonsense syllables were drawn from the 53%, 
47%, and 40% Glaze lists (Hilgard, 1951), 
and were divided into nine lists each of 12 
syllables with equivalent (47% median) 
associative value. The lists were assigned 
at random to the cells for use as first lists. 
Because of the possible effects of rehearsal 
among classmates, nine lists (rather than the 
five required for counterbalancing) were used 
to discourage communication among Ss. 
Each list was then used a total of three times, 
once as OL, PL, IL with each cell having a 
different list for OL and a different combina- 
tion of lists for PL and IL. Hilgard’s (1951, 
p. 540) suggested rules for use of nonsense 
material were followed. 

Procedure.—The basic paradigm for this 
study was X-A-B-A. That is, in Session 1 
Ss were given either zero (Po), one (P;), or 
two (P2) practice lists (X). In Session 2 they 
were given a test list (A) and in Session 3 
they were given zero (Ro), one (R,), or two 
(R2) RI lists (B) immediately prior to the 
recall of A. Twenty-four hours separated 
each session. During the planning of the 
experimental design, the question of appro- 
priate intertreatment time intervals was 
considered in detail. The IL Recall interval 
was based upon available literature (Archer 
& Underwood, 1951; Briggs, 1954) which had 
shown no change in RI effects over 48 hr. 
However, the data on the effects of varying 
PI Recall temporal interval had not yielded 
clearcut results prior to Underwood's recent 


studies (1948; 1949), When considered 
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together, these two experiments reveal that 
the time interval chosen in the present study, 
48-hr., maximizes the possible influence of PI 
on recall. Moreover, other evidence (Melton 
& Von Lackum, 1941; Ray, 1945) indicates 
the maximal effect of PI on transfer to be 
between 24-48 hr., which tends to justify the 
24-hr. PI-OL interval used. 

The experiment was run as a two-factor 
variance design with 10 Ss assigned at random 
to each cell. By means of the method of 
anticipation each list was learned to one error- 
less trial. The Ss of each cell were run in a 
group, thereby yielding nine groups of 10 
each. Broken appointments necessitated fill- 
ing the cells with the following additional Ss, 
also run in groups where possible: PoRo, 1; 
PoRa, 1; Pi Ro, 5; P,R,, 4; P2Ro, e. P.R,, Zz 
Anticipations were written on a spiral pad 
and after each trial, S turned to a new page. 
As each S reached criterion, he notified E 
and then proceeded to an irrelevant task 
inserted to obscure the actual purpose of the 
experiment. The E continually checked the 
writing of the anticipations during the experi- 
ment, and the group technique of collecting 
the data proved extremely effective. Only 
one 5's data were not counted because she 
wrote down the syllables after they were 
exposed. Both recall and learning were 
measured. Recall score was the number of 


correct anticipations on the first relearning 


trial. Relearning score was number of trials 
to one errorless repetition. Responses were 
not forced during recall and relearning. 
Although the purpose of the experiment 
was to study forgetting as function of inter- 
ference, Ss were told that the experiment was 
an attempt “to study the effects of verbal 
learning on motor problem solving.”” A jig- 
saw puzzle was utilized to hide the actual 
purpose. It was hoped that with this subter- 
fuge, motivation to learn, social pressure, and 
rehearsal between sessions would all be 
properly controlled. Prior to the PI session 
(X), P; and P, Ss were instructed that before 
starting “the main portion of the experiment 
tomorrow, I am going to give you practice 
in the tasks in order to get you acquainted 
with the experimental materials.” To lessen 
possible tension, stemming from the group 
setting, assurance was given that many Ss 
were being used simultaneously to facilitate 
collecting data and that Ss should pay no 
attention to their neighbors’ work. Code 
numbers were used to identify the work of 
each S. As Ss reached criterion on the list, 
they were given difficult jigsaw puzzles to 
solve. Each S kept his own record of time 
and number of pieces correctly joined together. 
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As soon as the last person in the group 
reached criterion on the list, puzzle solving 
was stopped, and dependent on the experi- 
mental condition, the group learned a second 
list or were dismissed for the day. At this 
point those who had not had time to work on 
the puzzles were reassured that they would 
have “an opportunity to do so before the 
experiment is completed.”’ At the end of 
each session, Ss were asked to forget how their 
performance seemed that day, to relax be- 
tween sessions, and to be ready to start fresh 
on the following day. 

That the instructions and the puzzle were 
effective was clearly shown at the end of the 
experiment. First, when Ss were told after 
learning B that the next list they were to 
learn was the same list which they had been 
given the previous day (OL) all expressed 
surprise and bemoaned such an undertaking. 
Secondly, in a postexperiment interview, no 
student guessed the purpose of the experi- 
ment and nearly all had accepted the stated 
purpose without question. Finally, no stu- 
dent reported rehearsal of lists between 
sessions. 


RESULTS 


Although specific first lists differed 
among all groups, an analysis of vari- 
ance of learning first lists showed no 
significant differences existed initially 
between the nine groups (F = 1.37, 
8 and 81 df). Furthermore, a similar 
analysis on the learning of the test 
list (OL), also different for each cell, 
revealed no differential practice effects 
based upon different amounts of PI 
learning (F = 1.42, 8 and 81 df). 

Table 1 presents the mean per- 
centages recalled and trials to relearn 
for each treatment group. The Bart- 
lett test of homogeneity of variance 
computed on both the recall and re- 
learning data was not significant in 
either case. Both main effects were 
significant for recall beyond the .001 
level (see Table 2). There was no 
significant interaction. The relearn- 
ing data essentially substantiate these 
findings although a significant inter- 
action appears in Table 2. The only 
PI simple effect which is significant 
occurs at the R. level (F = 5,16, 
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P < 01). The RI effects are signifi- 
cant at the P,; and P; levels (F = 3.7, 
F = 7.59, P < .01). Critical differ- 
ences between cell means indicate 
. three significantly different groupings 
in recall: the zero interference condi- 
tion, the pure RI and PI conditions, 
and the mixed interference conditions. 
In relearning the difference between 
no interference and the pure PI and 
RI conditions drops out. 

Further analysis, based upon the 
quality of interference, yielded effects 
not apparent in the critical-t compari- 
son. Considering as treatments the 
zero interference condition, pure PI, 
pure RI, and the combined PI and 
RI conditions, the data were next 
analyzed as a simple randomized 
design of four groups. The dimen- 
sion consisted of type of interference. 
The F ratios for both recall (F = 57.5, 
3 and 86 df) and relearning (F = 8.7) 
were significant (P < .001). Figure 
1 shows clearly four different levels 
of recall dependent upon the nature 
of the interpolated activity. None 
of the intralevel comparisons were 


TABLE 1 


MEANS FOR PERCENTAGE RECALL, FOR RE- 
LEARNING TRIALS, AND FREQUENCY OF 
INTRUSION DuRING RECALL or OL 
FROM PROACTIVE AND RETRO- 

ACTIVE SOURCES 


> - a : 
; — | pest Intrusions 


Cond. 


Mean| SD 


Note.—Critical difference for .05 level of confidence 
17.85% Recall, 1.22 trials Relearning. 


399 


TABLE 2 


ANALYSES OF VARIANCE ON RECALL AND 
RELEARNING SCORES 








significant, a finding which indicates 
that the variations from one list to 
two, etc., at any given interference 
level were random. The reason for 
this is probably that the 12-syllable 
nonsense list of 47% associative value 
learned by anticipation was not a 
sufficiently sensitive task. Interlevel 
comparisons were all significant, how- 
ever, beyond the .01 level. For the 
smallest difference (between pure 
PI and pure RI) ¢ = 4.6 and P < .01 
(df = 38). Supplementary informa- 
tion on the type of interference ob- 
tained is given in Table 1. The inter- 
list intrusions at recall clearly show a 
predominance of proactive words (an 
effect which can be expected with the 
relative temporal conditions utilized). 


DISCUSSION 


In terms of comparison with Under- 
wood's review (1957), it is most im- 
portant to note the 72% recall for the 
zero interference condition (Po Ro) in 
the present investigation. This cor- 
responds almost identically with the zero 
point of Underwood's PI curve and thus 
adds strength to the comparability of 
the two sets of data. In a larger sense, 
this agreement means that Underwood's 
general conclusion that experimental 
manipulation accounts for 75% of the 
usual obtained variance in forgetting 
is verified. (Note that this stability 
of occurrence appears despite variations 
in apparatus, tasks, etc.) In addition, 
as seen in Table 1 of the present study, 
the effect of two proactive lists, 48% 











ROBERT J. SEIDEL 





10of-- 
© ZERO I 
R oPL 
Cc gor @RI 
. 4 x PI+RI 
T * P2eR: 
. TP, Re 
¢ Sor 
4 
C ° ? 
‘ 40r 
L @ 
20r 
e 
x ao 
t 
O a 1 l L j 
19) | 2 3 a 


TREATMENTS (INTERFERENCE LISTS) 


Fic. 1. 


recall, was almost identical to the recall 
(approximately 47% for two warm-up 
lists) shown in Underwood's curve. 

It is most interesting to note that the 
present and the collated data yield simi- 
lar upper limits. However, as seen in 
Fig. 1 the slope of any given recall 
function is dependent on the quality 
of the interfering material. For ex- 
ample, given the temporal intervals in 
the present study, the curve of the com- 
bined interference data is much steeper 
than either PI or RI alone. Further, 
one previous retroaction study compar- 
able in part to the present experiment 
presents similar data. Lester’s (1932) 
experiment included a group given one 
retroactive list in addition to the two 
practice lists given everyone (PI); and 
for the RI Group recall was 3.3% as 
compared with 35.3% for the correspond- 
ing PI Group. The comparable cells 
in Table 1 show a similar drop from 48% 
recall to 16% with the inclusion of one 
RI list. Going from two to three lists 
in Underwood's curve, however, results 
in a slight drop from 47% to approxi- 
mately 42% (Underwood, 1957, p. 53). 


What the latter curve represents, of 


Recall and type of interference. 


course, is a forgetting function based 
solely on PI. With as many as 21 PI 
lists, the monotonic function reaches 
almost total forgetting as a limit. Cer- 
tainly increasing PI or RI may result 
eventually in complete experimental 
forgetting. The value of the present 
study, however, rests in showing, given 
the comparability to Underwood's col- 
lated data, that the combined PI-RI 
interference function arrives at this point 
much more rapidly. Furthermore, in 
view of the practical inevitability for the 
occurrence of at least some informal 
PI and RI in any laboratory experiment 
on forgetting, a paradigm containing 
both interference components seems to 
be the realistic one for the experimental 
study of forgetting. 

Incidental to the main purpose of the 
investigation discussed above, the pres- 
ent study also has produced data relevant 
to the process of forgetting. Although 
the data are not crucial to the issue, 
the intrusions do, as shown in Table 1, 
provide some interesting information 
concerning a one- or two-factor theory 
(interference and unlearning) of for- 
getting (Melton & Irwin, 1940). The 








CONCURRENT EFFECTS OF PI AND RI 


theoretical argument goes that, if compe- 
tition alone is responsible for forgetting, 
then the learning of interpolated lists 
should be interfered with to the same 
degree by prior learning (PI) as the 
recall and relearning of the original list 
is by interpolated learning (RI). In 
the most general case, the proponents 
of two-factor theory have stated that 
in order for a competing response or 
interference theory to be correct, RI 
should equal PI (Underwood, 1945). 
Note that Fig. 1 shows pure PI to yield 
greater recall than pure RI. It is true 
that such an effect has previously been 
reported; hence, the two-factor theory. 
However, in none of these classical 
studies on forgetting (McGeoch & Irion, 
1952; McGeoch & Underwood, 1943; 
Melton & Irwin, 1940; Melton & Von 
Lackum, 1941; Underwood, 1945) were 
PI and RI investigated concurrently. 
Typically one would measure PI or RI 
with one group of subjects or set of lists 
and then measure these effects with the 
other separately. The difference ob- 
tained between the curves was said to 
be due to unlearning. The present study 
at this point seems to support this view. 

Indication that interference and un- 
learning may not operate as customarily 
conceived, however, is found in Table 
1 by the predominance of proactive 
intrusions at recall.2 If unlearning in- 
fluences forgetting, then it follows that 
material which has more similar re- 
sponse learning subsequent to its being 
learned should be subject to more un- 
learning and thus to greater forgetting 
than material having less such learning 
following. This means that the pro- 
active lists in the present study had a 
greater probability of being influenced 
by unlearning than did the retroactive 
lists. It also follows that, if greater 
unlearning occurs for PI, then it should 


?An additional difficulty for any simple 
interpretation is that, at least in one case 
(Underwood, 1948), the difference between 
RI and PI was found to disappear as the 
OL-Recall interval was increased from 5 to 


48 hr. Although RI did not change, PI 
increased. In this study, PI lists occurred 
immediately prior toOL. , ’ 


401 


exert less overt interference (e.g., intru- 
sions) on recall than the RI. Yet the 
data in Table 1 show the intrusion 
frequency to be reversed, and thus are 
contrary to the unlearning hypothesis. 
From a strict interference viewpoint, 
however, one cani readily account for 
this outcome. To begin with, overt 
interference (intrusions) is greater when 
the competing response systems are less 
discriminable. Response systems in turn 
are differentiable in inverse relation to 
time between their learning and the 
recall of one system, e.g., the test list 
(Underwood, 1945). Note that strong 
interference or overt intrusions indicates 
poor response differentiation. Given the 
procedure in the present study, this 
interpretation would predict more overt 
PI (lists learned 48 hr. prior to recall) 
and greater implicit (weaker) RI (lists 
learned immediately prior to recall). 
Table 1 agrees with the prediction. 
Although the present analysis does 
indicate, then, general agreement with 
empirical findings on interference and 
supports a general interference theory 
of forgetting, the data do oppose a 
simple PI or RI formulation. Actually, 
in terms of a competing response inter- 
pretation which includes both PI and 
RI the results make good sense. When 
either proactive lists or retroactive lists 
are introduced alone, they act to compete 
unidirectionally with the responses of 
the test list. But when the interfering 
materials are combined, the test list 
is acted upon from two directions at 
recall, prior learning, and interpolated 
learning. Of course, the question of 
the reasons for the difference in related 
strengths of RI and PI still remains. 
To speculate a bit further, one might 
consider the learning of many lists simply 
serial learning on a larger scale. In 
any serial list the words between similar 
words are learned with greater difficulty 
than those benefiting from primacy or 
recency. Similarly, recalling the first 
or last list learned in a series may be 
easier than recalling lists learned in the 
middle of the series. In fact, this 
interpretation suggests the possibility of 
an interlist serial recall curve analogous 
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to the usual intralist serial learning bow. 
If the analogy holds, it would mean 
that other serial phenomena such as the 
von Restorff effect should also be found 
in interlist recall studies. 


SUMMARY 


This experiment was designed to investi- 
gate concurrent effects of PI and RI on for- 
getting. Ninety Ss, 10 per cell, were used 
in a two-factor design consisting of zero, 
one, or two amounts of PI and/or RI. Mem- 
orization of lists of nonsense syllables was 
used as the experimental tasks. Recall and 
relearning were measured. 

The data supported a general interference 
theory: (a) Increasing number of interfering 
lists learned prior to recall of the test list, 
and (b) quality of interfering lists, viz., pure 
PI, pure RI, or combined PI-RI, yielded 
increasing amounts of forgetting. 
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VARIABLES AFFECTNG CHILDREN’S PERFORMANCE 
IN A PROBABILITY LEARNING TASK! 


HAROLD W. STEVENSON anp MORTON W. WEIR 


University of Texas 


It was found in a recent study 
(Stevenson & Zigler, 1958) that 
children’s performance in a_prob- 
ability learning task differed for 
normal and institutionalized feeble- 
minded children and for normal 
children given pretraining on games 
involving high and low degrees of 
success. It was hypothesized that 
these differences were related to the 
children’s different expectancies for 
frequency of reinforcement. These 
expectancies were assumed to affect 
performance by determining the de- 
gree to which the child would accept 
a solution in. the probability learning 
task which yielded less than 100% 
reinforcement. It was predicted that 
Ss with low expectancies would show 
a greater frequency of choice of the 
reinforcing stimulus than would Ss 
with higher expectancies. This would 
result from the attempt by Ss with 
higher expectancies to seek, through 
variable behavior, a means by which 
they could obtain a frequency of 
reinforcement corresponding to the 
frequency which they expect. 

In the present study three experi- 
ments are reported investigating the 
effects of three variables on prob- 
ability learning in children: CA, 
incentive conditions, and shifts in 
percentage of reinforcement. It is 
assumed that older children have 
expectancies for a higher percentage 
of reinforcement than do younger 
children and that the performance 
of these groups will differ in the 

This study was supported by a grant 


from the National Science Foundation (Grant 
3280) to the first author. 


manner outlined above. In the ex- 
periment involving incentive condi- 
tions, the same general hypothesis is 
tested; however, it is assumed that 
the degree to which the child will 
accept a particular frequency of 
reinforcement is a function in part 
of the value of the reinforcement to 
S. In the third experiment, the 
effects of shifts in percentage of rein- 
forcement during the experimental 
trials are investigated. The hypothe- 
ses tested in each experiment are 
discussed in greater detail in later 
sections. 


EXPERIMENT | 


The purpose of this experiment was 
to investigate the behavior of young 
children of different chronological 
ages in a probability learning task. 
There is some basis for assuming 
that performance in such a_ task 
would differ as a function of CA. 
It has been suggested (Goodnow, 
1955) that one of the conditions 
influencing performance in a proba- 
bility learning task is the level of 
success § will accept in the task. 
It seems reasonable to assume that 
older Ss’ greater familiarity with 
soluble problems and their greater 
desire to master adult-controlled tasks 
would result in their being less likely 
than younger children to accept low 
levels of success. It is assumed that 
when less than 100% reinforcement 
is available older children would 
seek a solution which would provide 
a level of success more acceptable 
than that attainable by the consistent 
choice of the reinforcing stimulus. 
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The variable behavior resulting from 
their attempts would reduce the fre- 
quency of choice of the reinforcing stim- 
ulus below that found with younger 
Ss. It is predicted, therefore, that 
with levels of reinforcement below 
100% there is an inverse relationship 
between frequency of choice of the 
reinforcing stimulus and chronological 
age. It is further predicted that 
under conditions of 100% reinforce- 
ment the asymptotic frequency of 
correct response will be comparable 
at all CA levels, since 100% rein- 
forcement provides a level of success 
acceptable to all Ss. 


Method 


Subjects—The Ss were 120 children 
attending nursery and elementary schools in 
Austin, Texas.? The Ss were selected at 
random from among children of the appro- 
priate CA's enrolled in the schools. Children 
at four age levels were used: 3-0 to 3-11, 
5-0 to 5-11, 7-0 to 7-11, and 8-9 to 10-2. 
There were 30 Ss at each CA level. 

Apparatus.—The apparatus has _ been 
described previously (Stevenson & Zigler, 
1958). It consisted of a yellow panel con- 
taining a horizontal row of three knobs. 
Above the knobs was a signal light and below 
the knobs was a delivery hole for marbles. 
The marbles fell into an enclosed plastic 
container. 

Procedure.—The S was seated in front of 
the apparatus and was told that he was to 
play a game. The E demonstrated the 
apparatus, and said, ‘When the light comes 
on, you push one of the knobs. If you push 
the correct knob a marble comes out here, 
like this. Now every time the light comes 
on you push the knob that you think will get 
*The writers wish to thank Larry 
Schmucker for testing Ss in Exp. I, Mary 
Bess Whidden for testing Ss in Exp. II, 
and Martin Steigman for testing Ss in Exp. 
III, and the directors or principals of All 
Saints Nursery School, Austin High School 
Child Development Laboratory, University of 
Texas Nursery School, Tarrytown Methodist 
Nursery School, and St. Louis School, St. 
Austin School, St. Mary's School, and Trinity 
Lutheran School for their cooperation in 
making Ss available. 
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you the marble. Remember, just push one 
knob each time the light comes on.” The 
S was told that he was to get as many marbles 
as he could and when the game was over he 
could choose two toys from a selection of toys 
including balloons, plastic figures, etc., 
which E showed him. 

Three conditions which differed in the 
percentage of reinforcement of correct re- 
sponse were used. For each S, one of the 
three knobs (either L, M, or R) was desig- 
nated as the correct knob. The knob which 
was to be correct for a particular S was 
determined by a prearranged random se- 
quence. An equal number of Ss was tested 
with each knob. The correct knob yielded 
reinforcement ; choices of the other two knobs 
were never reinforced. Depending upon the 
condition, the correct knob yielded 100%, 
66%, or 33% reinforcement. In other words, 
a contingent procedure of reinforcement was 
employed whereby S was reinforced for a 
certain percentage of the times that he chose 
the correct knob. For example, in the 33% 
condition reinforcement was delivered on 33% 
of S’s choices of the correct stimulus rather 
than on 33% of the trials. Ten Ss at each 
CA level were assigned at random to each 
of the three conditions. In the 66% and 33% 
conditions the trials on which a choice of the 
correct knob was reinforced were determined 
by a random sequence of 30 trials. This 
sequence was repeated when S pressed the 
correct knob on more than 30 trials. The 
Ss were given a total of 80 trials each. 


Results and Discussion 


The average numbers of correct 
responses in blocks of 10 trials for 
the four age levels and the three 
percentages of reinforcement are pre- 
sented in Fig. 1. An examination 
of the curves for the 33% and 66% 
groups reveals that performance dif- 
fered as a function of CA. As pre- 
dicted, there is a tendency for older 
Ss to choose the reinforcing stimulus 
less frequently than younger Ss. An 


analysis of variance of the results of 
the total 80 trials indicates a signifi- 
cant difference associated with CA 
(F = 4.43, df = 3 and 72, P < .O1). 
A significant difference is found also for 
percentage of reinforcement (/= 7.37, 
df = 1 and 72, P < .01). 


The inter- 
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action between CA and percentage of 
reinforcement is not significant (F< 1). 

Performance in the groups receiving 
100% reinforcement was similar at 
each age level. It was predicted that 
asymptotic level of response would 
be comparable in all four groups. 
This was found to be true, for all 
but six Ss chose the correct stimulus 
on all of the last 20 trials. There 
were two Ss in each age group except 
CA 3 who did not consistently choose 
the correct knob during the last 20 
trials. 


There are several additional features 
of Ss’ performance which are of interest. 
The Ss initially had strong position 
preferences. On Trial 1, 15% of the Ss 
chose the left knob, 50% the middle 
knob, and 35% the right knob. As a 
control for possible knob preferences, 
each knob was correct for an equal 


number of Ss at each CA level. Knob 
preferences did not operate throughout 
all the trials, however, for of the total 
number of correct responses made on the 
80 trials, .33 were made when the left 
knob was correct, .35 when the middle 
knob was correct, and .32 when the right 
knob was correct. 

In order to determine whether the 
poor performance of older Ss was a result 
of their developing patterns of response, 
each individual’s performance was ex- 
amined. A high frequency of LMR and 
RML patterns of response appeared. 
The Ss at CA 3 had an average of 8.4 
repetitions of these patterns, Ss at CA 5 
an average of 14.8, Ss at CA 7 an average 
of 14.5, and Ss at CA 9 an average of 
14.0. No other patterns of response 
were readily apparent. The behavior of 
Ss at CA's 5, 7, and 9, therefore, was not 
random since approximately 50% of the 
responses wcre made in the form of one of 
the patterns analyzed. 


This may indi- 
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cate that the performance of these Ss is 
attributable in part to the fact that they 
were more likely than were the youngest 
Ss to form complex hypotheses resulting 
in the patterning of their responses. 

The high level of performance of the 
three-yr.-old Ss indicates that a task of 
this type is within the level of ability of 
young nursery school children. These 
Ss, as are most children attending private 
nursery schools, were highly selected. 
The older children, however, were also 
from private schools and were above 
average in their performance on group 
intelligence tests. 


EXPERIMENT II 


The purpose of this study was to 
investigate the effect of two incentive 
conditions upon performance of chil- 
dren in a probability learning task. 
The hypothesis to be tested was that, 
under conditions of less than 100% 
reinforcement, the frequency of choice 
of the reinforcing stimulus will vary 
inversely with the value of the in- 
centive. The hypothesis is derived 
in the following manner. In simple 
probability learning tasks such as the 
one used, children should learn rather 
quickly that the reinforcing stimulus 
does not pay off 100% of the time. 
If S accepts this frequency of rein- 
forcement, S should choose the rein- 
forcing stimulus with a high degree 
of consistency. If this frequency of 
reinforcement is not acceptable, S 
must attempt to find a solution to 
increase the frequency with which 
he is rewarded. This attempt leads 
to variable behavior, with the result 
that the frequency of choice of the 
reinforcing stimulus is lower than 
that occurring if such an attempt 
is not made. It is assumed that 
incentives affect performance by de- 
termining the degree to which S 
would attempt to find such solutions. 
It is further assumed that if incentives 
are of high value S is more likely to 
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seek a solution yielding consistent 
reinforcement than if incentives are 
of low value. In this manner in- 
centives of high value would produce 
more variable behavior than would 
incentives of lower value, with the 
consequence that there would be an 
inverse relationship between frequency 
of choice of the reinforcing stimulus 
and incentive value. 

Under conditions of 100% rein- 
forcement no differences between in- 
centive groups should appear, ex- 
cept perhaps in the rate at which 
an asymptotic level of response is 
approached. 


Method 


Subjects—The Ss consisted of 60 five-yr.- 
old children attending nursery schools in 
Austin, Texas. The Ss were selected at 
random from among children of the appro- 
priate CA enrolled in the schools. 

A pparatus.—The apparatus was the same 
as that described in Exp. I. 

Procedure.—High and low incentive con- 
ditions were used. In the high incentive 
condition, the choice of the reinforcing knob 
resulted in the delivery of a trinket. A wide 
assortment of small plastic trinkets was pro- 
vided. In the low incentive condition, the 
choice of the reinforcing knob resulted in the 
delivery of a marble. The S was told that 
at the end of the experiment the marbles 
could be exchanged for three trinkets of the 
types used in the high incentive condition. 

As in Exp. I, reinforcement was delivered 
on 33%, 66%, or 100% of the choices of the 
correct knob. The procedure was the same 
as that in Exp. I except for the reinforce- 
ments used. 


Results 


The average numbers of times that 


the correct knob was selected in 
each incentive condition under the 
three percentages of reinforcement 
are presented in Fig. 2. The curves 
for the 100% reinforcement groups 
are highly similar and do not differ 
significantly (¢ = .12). The data in 
Fig. 2 support the prediction that 
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the low incentive groups in the 66% 
and 33% conditions would choose 
the correct knob more frequently 
than the high incentive groups. An 
analysis of variance of these data 
results in an F of 6.71 (P = .05, 1 
and 36 df). The difference in fre- 
quency of choice of the correct 
knob in the groups receiving 66% 
and 33% reinforcement is not sig- 
nificant (F = 2.70, P > .05, df = 1 
and 36). The interaction between 
percentage of reinforcement and in- 
centive condition is not significant 
(F <1). 


The higher level of performance in the 
low incentive, compared to the high 
incentive groups in the 33% and 66% 
conditions is an inversion of the results 
typically found in learning situations. 
It has been noted earlier (Stevenson & 
Zigler, 1958) that even though Ss in the 
present type of problem may not choose 
the reinforcing stimulus consistently, 
they do know which stimulus has pro- 
vided reinforcement. This suggests that 
performance in this situation does not 
mirror the learning that has taken place; 
rather, it indicates the degree to which 
S will accept the percentage of rein- 
forcement provided by the reinforcing 
stimulus. 

The two incentive conditions ap- 
peared to operate in the anticipated 
manner; Ss in the high incentive groups 
seemed to be much more pleased to get 
the trinkets than were Ss in the low 
incentive groups to get the marbles. 

These results differ from those of 
Messick and Solley (1957), who report 
exploratory data indicating an increase 
in frequency of choice of the reinforced 
stimulus when candy was compared to 
verbal reinforcement as the incentive. 
It is difficult to compare the results of 
the two studies because of differences in 
procedure and because of the descriptive 
nature of their results. 

The results may be interpreted as 
providing support for the suggestion 
by Goodnow (1955) that one of the 
conditions affecting the distribution of 
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choices in a probability learning task is 
S’s ability to overlook losses. If the 
reward has significance for S, he may be 
less likely to overlook loss than when the 
reward has less significance. The S's 
inability to overlook losses, as Goodnow 
suggests, would lead to more variable 
behavior than when S is less concerned 
about the failure of a response to lead to 
reinforcement. Behavior would become 
more variable as S attempts to seek a 
solution to minimize the frequency of 
nonreinforced responses. 


EXPERIMENT III 
The purpose of this experiment was 
to investigate the performance of Ss 
in a probability learning task when 
the probability of reinforcement is 
changed during the course of the 
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experimental trials. Several recent 
studies have been concerned with this 
problem. Estes and Straughan (1954) 
found that Ss matched the stimulus 
probabilities with their choices, re- 
gardless of whether or not Ss had 
prior experience with different stimu- 
lus probabilities in the task. This 
was interpreted as providing support 
for Estes’ statistical learning theory, 
which predicts that after a series of 
trials Ss’ choices depend upon the 
present stimulus probabilities and 
are independent of earlier experience 
with different stimulus probabilities. 
On the other hand, Parducci (1957), 
using different shifts in probability 
from those employed by Estes and 
Straughan, found in two experiments 
that Ss differed in their final perform- 
ance as a function of an initial experi- 
ence with different stimulus probabi!i- 
ties. Differences in terminal per- 
formance were also found by Goodnow 
and Pettigrew (1955) in a “gambling”’ 


task involving several shifts in prob- 
abilities of reinforcement during initial 


periods of training. The Goodnow 
and Pettigrew study was designed to 
determine whether a number of shifts 
in the probabilities and patterns of 
reinforcement would have an effect 
upon the strategies employed in 
the solution of subsequent problems. 

The purpose of the present study 
was to investigate the effects on 
performance of a wide range of 
systematic changes in stimulus prob- 
abilities. A three-stimulus task was 
used in this study in order to investi- 
gate the effects of prior probability 
of reinforcement and to determine 
some of the characteristics of the 
variability of response found in this 
type of task. 


Method 


Subjects.—The Ss consisted of 225 elemen- 
tary school children ranging in CA from 7.5 to 
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10.3 yr.4 All Ss within these age levels 
attending the schools visited were tested. 

Apparatus.—The apparatus was the same 
as that used in Exp. I and II. 

Procedure.—The general procedure was the 
same as that used in Exp. I and II and rein- 
forcements were the same as those used in 
Exp. I. 

The procedure differed from that in the 
previous experiments in that different rein- 
forcement probabilities were used. During the 
first 40 trials Ss received either 0%, 33%, 
66%, or 100% reinforcement of the correct 
response. During the last 40 trials Ss either 
continued to receive the percentage of rein- 
forcement obtained during the first 40 trials, 
or were switched to a different percentage of 
reinforcement. The two sets of 40 trials 
were presented in an uninterrupted sequence. 
The same knob was correct for a particular S 
throughout the experiment; only the per- 
centage of choices on which this knob was 
reinforced was changed. This procedure re- 
sulted in a 4X4 design containing all 
combinations of initial and final percentages 
of reinforcement. For example, Ss receiving 
33% reinforcement during the first 40 trials 
could receive either 0%, 33%, 66%, or 100% 
reinforcement during the last 40 trials, de- 
pending upon the condition to which they 
were assigned. Fifteen Ss were assigned at 
random to all but one cell of this design. One 
cell, 0% to 0%, was not filled. It was felt to 
be impractical to test children for 80 trials 
during which no reinforcement was received. 
Previous experience indicated that it is diffi- 
cult to maintain children’s motivation during 
protracted periods of 0% reinforcement. 
Among Ss receiving 33%, or 66% reinforce- 
ment, the trials on which a choice of the 
correct knob was reinforced were determined 
by 10 random sequences of 40 trials. These 
sequences were randomized among Ss tested 
in these conditions. 


Results 


When a contingent procedure of 
reinforcement is used in conditions 
involving other than 0% and 100% 
reinforcement it is impossible to be 
assured that Ss will receive exactly 
the designated percentage of rein- 
forcement of correct responses. In 
the present experiment, for example, 
the proportions of correct responses 


5 See Footnote 2. 





PROBABILITY LEARNING 





oO 35% -0% 
e——-—©@ 66% - 0% 
o——0 190% - O% 


9 


I] | 


TT | 


—-nNW SY OD WN 





ou 2a 











-NW Soon @ WO 








° 


| 2 3 a 
BLOCKS OF TEN TRIALS 





°o 


! 
BLOCKS OF TEN TRIALS 





MEAN CORRECT RESPONSE 


—-NW &FUoON @ WO 


o———@ 53% - 66% 
o——-O 66% 66% 
Oo——-O 100% - 66% 


VS eee ees 











TTT] heehee Fa 


o—® 0% -100% 
o-——@ 555° 100% 
o————O 66% 100% 
Oo—- — —O 100% - 100% 


Dna 








°o 


| 2 3 4 
BLOCKS OF TEN TRIALS 


Fic. 3. 





2 3 4 
BLOCKS OF TEN TRIALS 


Mean number of correct responses in blocks of 10 trials during 


Trials 41-80: Exp. ITI. 


on which Ss in the 33% and 66% 
conditions were reinforced were .35 


and .64, respectively. The designa- 
tion of groups as 33% or 66% does 
not, therefore, represent exactly the 
proportion of trials on which rein- 
forcement was received. 

During the first 40 trials the 
average numbers of choices of the 
reinforcing stimulus made under con- 
ditions of 0%, 33%, 66%, and 100% 
reinforcement differed from each other 
consistently. An analysis of variance 
of these data reveals a highly sig- 
nificant difference among the groups 
(F=92.8, df=3 and 221, P<.001). 

By the end of the first 40 trials, 
Ss in each group appeared to be 
approaching an asymptotic level of 
response. The question of whether 


an asymptote was reached may be 
answered by determining whether 
there was a significant change in 
performance between Trials 21-40 
and 61-80 by groups for whom the 
probability of reinforcement remained 
constant throughout the experiment. 
The differences between Trials 21-40 
and Trials 61-80 in the average 
numbers of choices of the reinforcing 
stimulus are not significant at the 
.05 level (F = 3.83, df = 1 and 42). 

The results for Trials 41-80 are 
presented in blocks of 10 trials in 
Fig. 3. In this figure Ss are classified 
according to the percentage of rein- 
forcement received on Trials 1-40 
and 41-80. For example, the nota- 
tion, 33%-0%, indicates that this 
group received 33% reinforcement 
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of choices of the correct response 
during Trials 1-40 and 0% reinforce- 
ment during Trials 41-80. 

In Fig. 3 the frequencies of choice 
of the reinforcing stimulus appear 
to vary both with the stimulus prob- 
abilities of Trials 41-80 and with the 
stimulus probabilities of Trials 1-40. 
It is clear from Fig. 3 that there are 
differences in the average number of 
choices of the reinforcing stimulus 
by Ss receiving 0%, 33%, 66%, and 
100% reinforcement during the last 
40 trials. During the terminal trials 
there is an increase in the average 
frequency of choice of the reinforcing 
stimulus as the proportion of rein- 
forcement received during these trials 
increases from 0% to 100%. It is 


also apparent that there are dif- 
ferences among the groups receiving 
the same percentage of reinforcement 
during Trials 41-80. These differ- 
ences are related to the percentage 
of reinforcement received during Trials 


1-40. No simple relationship be- 
tween the performance on the last 
40 and the first 40 trials is apparent. 
There was no consistent tendency 
for Ss changed from high to low or 
from low to high probabilities of 
reinforcement to fall above or below 
the level of response shown by Ss 
who had consistently been reinforced 
at the level to which Ss were shifted. 
There is some tendency ‘for the 
groups receiving 33% and 66% rein- 
forcement during the last 40 trials 
to be ordered according to the per- 
centage of reinforcement received 
during the first 40 trials. In the 
groups receiving 0% and 100% rein- 
forcement during the last 40 trials 
there is no tendency for the groups 
to be ordered in any fashion, and 
terminal levels of response in all 
groups are quite similar. 

To determine whether the differ- 
ences are statistically significant, an 
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analysis of variance of performance on 
Trials 61-80 was performed. For 
this analysis Ss were classified accord- 
ing to the probability of reinforcement 
under which they had been tested on 
Trials 1-40 and 41-80. A significant 
difference among the groups as a 
function of level of reinforcement dur- 
ing Trials 41-80 was found (F = 57.2, 
df =3 and 210, P < .001). (The 
entry for the 0%-0% cell in the 
analysis of variance was selected at 
random from one of the other cells 
in which 0% reinforcement was re- 
ceived during Trials 41-80.) A sig- 
nificant difference was also obtained 
as a function of the levels of rein- 
forcement Ss had received during 
Trials 1-40 (F = 3.96, df = 3 and 
210, P < .01). The interaction term 
was not significant (F = 1.17, df = 9 
and 210, P > .05). It is concluded, 
therefore, that performance on Trials 
61-80 differed significantly, both as a 
function of current and previous 
reinforcement probabilities. 


A question of recent interest is the 
relative frequency with which Ss choose 
nonreinforced stimuli in a probability 
learning task (Anderson & Grant, 1957). 
An analysis of the distributions of choices 
by Ss in the present study was made 
possible by the use of a three-stimulus 
task. The data from all conditions 
were combined, since the stimulus which 
was reinforced was randomized among 
all conditions. When the left knob was 
correct, the middle knob was chosen on 
an average of 20.7 trials and the right 
knob on an average of 18.5 trials. The 
difference in the average frequency with 
which the two knobs were chosen is 
significant at beyond the .001 level 
(t = 4.64, df = 85). When the middle 
knob was correct, the left knob was 
chosen on an average of 14.7 trials, and 
the right knob on 16.0 trials (¢ = 3.59, 
df = 72, P <.001). When the right 
knob was correct, the left knob was 
chosen on an average of 16.7 trials, and 
the middle knob on 20.8 trials (¢ = 6.96, 
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df = 65, P < .001). The Ss, therefore, 
did not distribute their choices of non- 
reinforced stimuli equally. The tend- 
ency to prefer the middle knob was also 
found when the initial response of each S 
in this experiment was examined. On 
Trial 1, 56% of Ss chose the middle knob, 
23% chose the right knob, and 21% chose 
the left knob. These tendencies were 
not found, however, when the total 
frequencies of choice of the reinforcing 
stimulus were considered. Individual ¢ 
tests comparing the proportion of times 
the correct stimulus was chosen under 
conditions where the left, middle, and 
right stimuli were correct indicate no 
significant differences (¢ < 1.41). 

These results are consistent with those 
of experiments showing that the terminal 
level of response in a probability learning 
task is affected by earlier experience with 
other stimulus probabilities. The re- 
sults do not support the conclusion of 
Estes and Straughan that asymptotic 
level of response is independent of prior 
experience with different probabilities of 
reinforcement. The significance of the 
present finding for Estes’ theory is not 
clear, for Estes has not made predictions 
concerning the three-choice case with 
contingent reinforcement of only one 
response. The problem facing a statis- 
tical learning theory in making such a 
prediction is that of determining some 
basis of distributing the effects of a non- 
reinforced trial between two incorrect 
stimuli. The present study provides 
evidence related to this problem: Ss did 
not distribute their choices equally be- 
tween the two nonreinforcing stimuli. 
The results indicate a hierarchy of re- 
sponse tendencies such that one of the 
nonreinforcing knobs was more likely to 
be chosen than another. 

Even though the results indicate that 
previous experience does affect final level 
of response, their similarity to those 
found in other probability learning tasks 
is striking, in that the level of response 
was dependent on the probability of 
reinforcement provided by the reinforc- 
ing stimulus and in that an asymptotic 
level of response was rapidly reached. 
The high dependence of the response 
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level upon probability of reinforcement 
was seen in the rapid changes in perform- 
ance that occurred following the change 
in probability of reinforcement. The 


rapidity with which the change in level of 
response occurred following the change in 
reinforcement indicates that Ss readily 
the change in 


discriminated 
probability. 


reward 


SUMMARY 


Three experiments are reported concerning 
the effects of differences in CA, incentive, and 
prior experience with different probabilities 
of reinforcement on children's performance in 
a probability task. 

In Exp. I, Ss at CA’s 3, 5, 7, and 9 were 
tested in a three-stimulus situation with 
reinforcement of only one of the three re- 
sponses. Reinforcement was delivered on 
33%, 66%, or 100% of Ss’ choices of the 
correct stimulus. An inverse relationship 
between frequency of choice of the reinforced 
stimulus and CA was found in the 33% and 
66% groups. No differences appeared among 
the age groups with 100% reinforcement. 

In Exp. II, Ss at CA 5 were tested under 
the same conditions employed in Exp. I with 
incentives of high or low value. In the 66% 
and 33% groups the frequency of choice of 
the reinforcing stimulus was significantly 
lower for the high incentive than for the low 
incentive groups. With 100% reinforcement 
no difference between the high and low in- 
centive groups in the frequency of choice of 
the reinforcing stimulus was obtained. 

In Exp. III, elementary school children 
were tested in a three-stimulus situation with 
reinforcement of only one of the three re- 
sponses. The Ss received either 0%, 33%, 
66%, or 100% reinforcement of the correct 
response for 40 trials. During a second 
series of 40 trials Ss either continued under 
the same reinforcement schedule, or were 
switched to one of the other levels of rein- 
forcement. All combinations of initial and 

«terminal reinforcement except the 0%-0% 
condition were used. Performance on the 
first 40 trials differed significantly among the 
groups and the groups were ordered according 
to the percentage of reinforcement received. 
The same ordering of the groups according to 
current percentage of reinforcement was found 
during the last 40 trials and again a significant 
difference in incidence of choice of the correct 
stimulus was found. During the last 40 
trials a significant difference among the groups 
was also found when the groups were classified 
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according to the percentage of reinforcement 
received during Trials 1-40. Percentage of 
reinforcement received during Trials 1-40 did 
not, however, affect performance in all groups 
on Trials 41-80 in any consistent fashion. 
There was a tendency for the performance of 
groups receiving 33% and 66% reinforcement 
during the last 40 trials to be ordered accord- 
ing to the percentage of reinforcement re- 
ceived during the first 40 trials. No such 
tendency was found for the groups receiving 
0% and 100% reinforcement during the last 
40 trials. 
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SOME FACTORS IN PROBABILITY MATCHING 
IRVIN RUBINSTEIN 
Walter Reed Army Institute of Research 


One of the results to emerge from 
two-choice learning situations has 
been the tendency of Ss to choose 
in such a way as to make their per- 
centage of choices of an item match 
the percentage of occurrence of that 
item (cf. Hake, 1955). This has 
been called probability matching. 
The situation presented to S might be 
a series of cards—some blue, some 
yellow—arranged in random order 
with the constraint that blue occurs 
75% of the time, while yellow occurs 
25% of the time. The S is expected 
to predict the color of the cards as 
they are turned up one by one. The 
S, after a reasonably long series of 
trials, will begin and continue to 
predict blue 75% of the time, perhaps 
a bit higher or lower. 

It has been argued that probability 
matching is strange behavior, and 
that S would achieve a best solution 
to the problem if he picked the more 
frequent event 100% of the time. 
Flood (1954) has pointed out, how- 
ever, that the optimal solution as- 
sumes that S becomes aware that the 
events form a random series, but 
few experiments make explicit the 
randomness and the impossibility of 
a complete solution. As an alterna- 
tive explanation for the optimal solu- 
tion not appearing, it has been argued 
that Ss place an inordinate value on 
correct prediction of the infrequent 
event (Goodnow, 1955; Grant, Hake, 
& Hornseth, 1951). 

The present experiment is a test 
of these two possibilities. For the 
effect of awareness of randomness, 
the series of events is generated 
before S, so that each event can be 
seen as clearly independent. For 


the effect of the infrequent event, S 
is presented with one frequent event 
and three infrequent events so that 
there is less chance of being correct in 
predicting an infrequent event. 


METHOD 


Design and material.—F or three groups of 
Ss, the more frequent event occurred 67% of 
the time. In all three groups, Ss were asked 
to predict on each of a series of 126 trials 
which of the alternative events would occur, 
and were instructed to aim at getting as 
many correct predictions as possible. 

Group A, essentially a control group, was 
given a conventional two-choice task. The 
material consisted of 126 cards—84 blue and 
42 yellow. The cards were randomly ordered 
on the length of runs; ie., the run-lengths 
expected on a chance basis were determined 
and these run-lengths then randomized for 
order. 

For Group B, the same stimulus series was 
used, except that 14 yellow, 14 orange, and 
14 white cards were used instead of the 42 
yellow. 

For Group K, three playing cards were 
used, two Kings of Hearts and one Jack of 
Spades, out of a regular deck of playing 
cards. These cards were shuffled before each 
trial, and one of the three cards was chosen by 
a member of the group of Ss. 

Procedure.—All Ss were tested in the same 
room, in groups of approximately 12 to 15 
men. Responses were written on pads with 
numbered pages, one response to a page, and 
performance was monitored by either one or 
two assistants. 

For Groups A and B the stack of cards was 
on E's desk, but hidden from the view of the 
Ss. The Ss were told the colors that could 
occur, and were instructed to write their first 
prediction in the middle of the first page. 
When this was done, E raised the card so that 
it could be seen by the Ss and at the same time 
called out itscolor. The Ss then immediately 
turned the page and made the next prediction. 
This procedure was continued in a regular 
rhythmic manner taking approximately 4 
sec. between presentations. 

In Group K, Ss were shown the cards that 
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TABLE 1 


MEDIANS AND MEANS, IN PERCENTAGES, OF THE SELECTION OF THE 67% STIMULUS 








Trials 





43-84 





Median 





51.19 
47.62 
80.95 

















could occur (two Kings of Hearts and one 
Jack of Spades) in each of the two sets of 
cards that were used. One S was used as an 
aide in order to impress Ss with the fact that 
cards would be picked on a chance basis. 
Two Es and the aide took part in the presen- 
tation of cards. All three were seated at the 
desk in front of the room. The £, shuffled 
one set of the cards, handed them to E:2, and 
she (£,) shuffled the alternate set. The 
E, laid the cards down separately on the desk, 
and the aide chose one by pointing at it. Al- 
though Ss knew what was going on and 
actually observed all of the steps, they were 
unable to see the cards as they lay on the desk. 
After the aide chose a card, Ss (who had the 
same type pads as the other groups) were 
asked to make their choices. The EZ, then 
picked up the chosen card and showed it to 
Ss, at the same time calling out what it was. 
Then he took the newly shuffled set from E, 
and put those cards on the table while E, 
shuffled the set used in the first trial. (This 
continued for 126 trials.) 

Subjects.—All Ss were drawn from enlisted 
military men at the Walter Reed Army 
Medical Center: 44 Ss for Group A, 41 Ss for 
Group B, and 37 Ss for Group K. 


RESULTS 


For the analysis the 126 trials were 
split into thirds giving Trials 1-42, 
43-84, and 85-126 as separate units. 
Table 1 gives the median percentages 
of the selection by the three groups 
of the 67% stimulus. In A and B 
that would be blue, while in K it 
would be “King.” Group K is seen 
to go well above a matching solution 
from the very beginning, starting 
out with 80.95% and ending up with 


78.57%. In all cases it is well above 
the other two groups. Group B 
starts off quite low, but ends up some- 
what above a matching solution. 
Group A follows the expected pattern 
of events starting at 51.19% and 
ending up near a matching solution. 

Median tests (Mood, 1950) were 
done and the resulting xs and P 
levels are shown in Table 2. Group 
K is significantly different from the 
other two groups at almost all points. 
In particular, when Group K_ is 
compared to Group A in the last 
third, P < .01. 

The median test shows no signifi- 
cant differences between Groups A 
and B. However, over the last 42 
trials the distributions (Fig. 1) of 
Groups A and B seem to be different 
with B having the greater range. 
In Group B, four Ss have selected 
the 67% event 40 to 42 times out of a 
possible 42, while one has selected 
it only once or twice. Group A has 
no Ss at these extremes. In general, 
as compared to Group B, Group A 
is more homogeneous, showing a 
stronger tendency to cluster normally 
about the mean. Some further in- 
dication that Group B’s distribution 
does differ from a theoretical distribu- 
tion more markedly than Group A’s 
is seen in that when Group B’s 
distribution over the last 42 trials 
is compared to the binomial theo- 
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TABLE 2 


x*’s AND P LEVELS FOR THE THREE 
Groups (MEDIAN TEsT) 








Groups 





1-42 


| 
| 





K and A |x?= 40.101 
” P<.001 





K and B |x?= 34.644 
P<.001 


A and B |x?= 2.597 
P<.20 














retical distribution with 6 df the x’ is 
45.434, giving a P value less than 
.000001, while the x’? of Group A’s 
when compared to the theoretical is 
11.869 giving a P value between .06 
and .08. 


DISCUSSION 

It has been pointed out (Edwards, 
1956) that when the concept of ‘match- 
ing solution”’ is applied to a group it does 
not account for many individual be- 
haviors within the group. It is apparent 
from Fig. 1 that the individual differences 
within Group B are more difficult to 
account for in terms of the one concept 
of ‘‘matching’”’ than are those of Group 
A. The greater range and bimodality of 
Group B's distribution suggest this. 
The factors causing it are not obvious. 
Perhaps the use of four colors in Group B 
as compared to two in Group A makes for 
a more complex situation where more 
theoretical solutions are possible to the 
subject. On the one hand, there is some 
clustering near the optimal solution 
which might have been caused by a 
“theory” of excessive failure on the not 
blue side. Secondly, there is some clus- 
tering about a matching solution. And 
thirdly, two Ss are very low, as though 
they were waiting for the not blue colors 
to make an appearance, operating per- 
haps under what has been known as the 
‘“‘gambler’s fallacy.” It may well be, 
then, that while the median test showed 
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Fic. 1. Distribution of Ss over the last 42 
trials according to the number of the 67% 
stimulus choices. 


no differences between Groups A and B, 
that the effect of B was more to strengthen 
any particular stratagem that S might be 
inclined to use. 

It is clear that marked differences may 
be obtained from Ss responding to a 
randomly generated series of the two- 
choice type depending upon the manner 
in which the series is generated. If the 
series is generated before S, so that it is 
clear from the beginning that the stimuli 
are to appear randomly, S does not seek 
a matching solution but goes well above 
it. Furthermore, no lengthy series of 
trials is needed since he reacts this way 
from the very beginning. On the other 
hand, since the optimum is rarely 
achieved (only three out of 37 Ss chose 
“King” for the whole 126 trials) S must 
still be using other devices. At times 
after the experiment S will ask if this was 
an experiment in mental telepathy. 
This belief occurs often in experiments 
of the type discussed in this paper and 
would certainly lower the chance of an 
optimal solution to K. We do not wish 
to exhaust the factors involved, but 
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simply to indicate that S will often 
interpret the task to his own choosing in 
a way that is simply not of a problem- 
solving nature, 

It has already been suggested (Good- 
now, 1955) that an important variable is 
the way S conceptualizes the task. Ina 
gambling situation in which money was 
involved more Ss reached optimal solu- 
tions. Morin (1955) found that when S 
was given a signal to show when the 
machine was “‘lying’’ (giving the low 
percentage stimulus) he went more 
toward an optimal solution. In the 
current study the effect of different 
conceptualization is clear. In the K 
experiment S did not have to contend 
with the solution of a serial problem, but 
only with dealing with probabilities. 
Does this throw any light on why he 
does not go toward an optimum solution 
in experiments of Type A? It may be 
that he never tires of searching for a 
solution where there is the faintest 
possibility that one exists. What this 
means is that the random effects that the 
experimenter creates does not reach the 
subject. ‘‘Randomness”’ as a solution to 
a problem may not be acceptable to a 
subject who is accustomed to believing 
that if he is asked to solve a problem, the 
problem is solvable. Such a limitation 
would seem to be cultural, and within it, 
a matching solution is a good-one. 


SUMMARY 


Three experiments were performed of a 


two-choice nature. In Group A two differ- 
ently colored cards were presented randomly 
with the restraint that blue occurred 67% of 
the time while yellow occurred 33% of the 
time. Group B used the same series as 
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Group A, but where only yellow was used 
with Group A, three different color cards were 
used. In a third Group, K, a series of 126 
stimuli (as in the others) was generated before 
S by the use of three playing cards. The 
main results were as follows: (a) The S chose 
the 67% stimulus a significantly greater 
number of times in Group K than in Groups 
A or B. (6) While Group B did not differ 
significantly from Group A in the number of 
choices of the 67% stimulus, comparison of 
their distributions suggests differential effects. 
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